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Abstract 

The natural way to use Answer Set Programming (ASP) to represent knowledge in Ar¬ 
tificial Intelligence or to solve a combinatorial problem is to elaborate a first order logic 
program with default negation. In a preliminary step this program with variables is trans¬ 
lated in an equivalent propositional one by a first tool: the grounder. Then, the proposi¬ 
tional program is given to a second tool: the solver. This last one computes (if they exist) 
one or many answer sets (stable models) of the program, each answer set encoding one 
solution of the initial problem. Until today, almost all ASP systems apply this two steps 
computation. 

In this article, the project ASPeRiX is presented as a first order forward chaining ap¬ 
proach for Answer Set Computing. This project was amongst the first to introduce an 
approach of answer set computing that escapes the preliminary phase of rule instantiation 
by integrating it in the search process. The methodology applies a forward chaining of 
first order rules that are grounded on the fly by means of previously produced atoms. 
Theoretical foundations of the approach are presented, the main algorithms of the ASP 
solver ASPeRiX are detailed and some experiments and comparisons with existing systems 
are provided. 

KEYWORDS: Answer Set Programming, solver implementation, grounding on the 
fly, first order, forward chaining. 


1 Introduction 


Answer Set Programming (ASP) is a very convenient paradigm to represent knowl¬ 


edge in Artificial Intelligence (AI) and to encode combinatorial problems (Baral 


2003 Niemela 1999). It has its roots in nonmonotonic reasoning and logic program¬ 


ming and has led to a lot of works since the seminal paper (Gelfond and Lifschitz 


1988). Beyond its ability to formalize various problems from AI or to encode com¬ 


binatorial problems, ASP provides also an interesting way to practically solve such 
problems since some efficient solvers are available. In few words, if someone wants 


* This work was supported by ANR (National Research Agency), project ASPIQ under the ref¬ 
erence ANR-12-BS02-0003. 
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to use ASP to solve a problem, he has to write a logic program in term of rules in a 
purely declarative manner in such a way that the answer sets (initially called stable 
models in (Gelfond and Lifschitz 1988)) of the program represent the solutions of 
his original problem. 


Illustration of ASP formalism Let us take two typical examples for which ASP 
is suitable: the first example is devoted to knowledge representation in Artificial 
Intelligence and the second one is a combinatorial problem. 


KR problem This first example deals with default reasoning on incomplete infor¬ 
mation. It consists in describing knowledge about birds. 
bird{titi). 
ostrich{lola). 
bird{X) <— ostrich{X). 
fly{X) -ir- bird{X),not ostrich{X). 
non_fly{X) ^ ostrich{X). 

The meaning of the two first rules is that we have two objects: titi which is a 
bird and lola which is an ostrich. The meaning of the other rules is that an ostrich 
is a bird, a bird which is not an ostrich flies and an ostrich does not fly. Here, 
we are interested in deducing some properties about titi and lola. Intuitively, we 
want that titi flies, lola is a bird and lola does not fly. Concerning the information 
that lola does not fly, let us notice that it is obtained by applying the last rule 
since lola is an ostrich and, then, the next to last rule cannot be applied in pres¬ 
ence of ostrich lola due to the part not of this rule, called default negation. Here, 
there is only one answer set which contains all the deduced pieces of information: 
{bird(titi), fly{titi), ostrichllola), bird(lola), non_fly(lola)}. 

CSP problem The second example deals with the representation of a combinatorial 
problem: possibles worlds are represented by nonmonotonic “guess” rules and choice 
between these worlds is expressed by constraints. The problem is then to find (at 
least) one solution corresponding to a world verifying the constraints. This example 
is about graph 2-coloring. 
vertex{l). 
vertex{2). 
edge{l,2). 

red{X) ^ vertex{X),not blue{X). 
blue{X) ^ vertex{X),not red{X). 

red{X), red{Y), edge{X, Y). 

^ blue{X), blueiY), edge{X, Y). 

This represents a graph with two vertices and an edge between them (three first 
rules). The two following rules are guess rules. The fourth (resp. fifth) rule means 
that a vertex which is not colored in blue (resp. red) has to be colored in red (resp. 
blue). The two last rules are constraints. They mean that two adjacent vertices 
can not have the same color. Here, we want to find how the two vertices should be 
colored (knowing that two colors are available). Intuitively, we have two solutions: 
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one with vertex 1 colored in blue and vertex 2 colored in red and the other one with 
vertex 1 colored in red and vertex 2 colored in blue. This corresponds to the two 
answer sets of the program: {vertex{l),vertex{2),edge(l,‘i),blue{\),red[2y\ and 
{vertex{\),vertex{‘2), edge{l,2)^red(\),hlue{2y\. However, let us note that, in this 
kind of problem, we are often interested in finding one solution rather than finding 
all the possible solutions (and the determination of only one answer set is enough). 

As regards the form of the rules, we can notice that a program usually contains 
different kind of rules. The simplest ones are facts as {bird{titi).) or {vertex{l).) rep¬ 
resenting data of the particular problem. Some ones are about background knowl¬ 
edge as {bird{X) x- ostrich{X).). Some others can be nonmonotonic as {fly{X) x- 
bird{X),not ostrich(X).) for reasoning with incomplete knowledge. In other cases, 
especially for combinatorial problems, nonmonotonic rules can be used to encode 
alternative potential solutions of a problem as {red{X) x- vertex{X), not blue{X).) 
and {blue(X) x- vertex{X),not red{X).) expressing the two exclusive possibilities 
to color a vertex in a graph. Last, special headless rules are used to represent con¬ 
straints of the problem to solve as red{X),red{Y),edge{X,Y).), here, in order 
to not color with red two vertices linked by an edge. 

With the examples above we can point out that knowledge representation in ASP 
is done by means of first order rules. But, from a theoretical point of view, answer 
set definition is given for propositional programs and the answer sets of a first order 
program are those of its ground instantiation with respect to its Herbrand universe 
(i.e. without variables). The first order program has to be seen as an intensional 
version of the grounded propositional corresponding program. 


ASP systems Concerning the ASP sytems, their main goal is how to compute 
answer sets in an efficient way. Let us recall some of their main features. Until 
today, almost all systems available to compute the answer sets of a program follow 
the architecture described in Fig. 



Fig. 1. Architecture of answer set computation 

An ASP system begins its work by an instantiation phase in order to obtain a 
propositional program (and, as said above, the answer sets of the first order program 
will be those of its ground instantiation). After this first grounding phase realized 
by a grounder the solver starts the real phase of answer set computation by dealing 
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with a finite, but sometimes huge, propositional program. The main goal of each 
grounding system is to generate all propositional rules that can be relevant for a 
solver and only these ones, while preserving answer sets of the original program. 
Current intelligent grounders simplify rules as much as possible. Simplifications can 
lead to compute the unique answer set of some programs (for instance, programs 
that does not contain default negation) but it is no longer possible once the problem 
is combinatorial. Anyway, the grounding phase is firstly and fully processed before 
calling the solver. 


For the grounder box we can cite Lparse (Syrjanen 1998) and Gringo (Gebser 


et al. 2011), and for the solver box Smodels (Simons et al. 2002) and Clasp (Geb¬ 


ser et al. 2012). A particular family of solvers are Assat (Lin and Zhao 2004), 


Cmodels (Giunchiglia et al. 2006) and Pbmodels (Liu and Truszczynski 2005), since 
they transform the answer set computation problem into a (pseudo) boolean model 
computation problem and use a (pseudo) SAT solver as an internal black box. In 
the system DLV (Leone et al. 2006), symbolized in Fig.[^by the dash-line rectangle, 
the grounder ((Calimeri et al. 2008) describes a parallel version) is incorporated 
as an internal function. In the same way, WASP (Alviano et al. 2013) uses the DLV 
grounder (Faber et al. 2012). 


Grounding The main drawback of the preliminary grounding phase is that it may 
lead to a lot of useless work as illustrated in the following examples. 

The first examples illustrate the fact that the separation between the instantiation 
phase and the computation phase can prevent the (efficient) use of information 
relevant to the computation. 


Example 1. Let be the following ASP program: 




a •<— not h., 
b not a., 


' u,., 

p(o)., 

p{X -|- 1) ^ a, p{X). 




Grounding of is infinite (if an upper bound for integers is not fixed) while it 
has a unique (and finite) answer set {b, p(0)}. 

Let be the following ASP program: 


% = < 


P(1)-,P(2).,..., p{N)., 
a not b., 

b not a., 

pa{X) ^ a, p{X)., 
pb{X) ^ b, p{X)., 


aa{X,Y) ^ pa{X), pa{Y), not bb{X,Y)., 
bb{X,Y) pb{X), pb{Y), not cc{X,Y)., > 

cc(A, Y) ^ aa{X, Y), X <Y., 
i — d, 


From the program A^, current grounders generate roughly 2.5 x rules. 


In both programs, because of the constraint (■<— a.) that eliminates from the 
possible solutions every atom set containing a, it is easy to see that rules {p{X-\-l) ^ 
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a, p{X).) for and {pa{X) x- a, p{X).) for are useless since they can never 
contribute to generate an answer set of the corresponding program. In these 
useless rules are infinite {p{X + 1) a,p{X).) while they are “only” large in 

N rules with positive body containing a, like (pa(l) ^ a,p{l).), and then, the 
rules with pa{X) in their positive body are useless too. In defense of the actual 
grounders, their inability to eliminate these particular rules is not surprising since 
the reason justifying this elimination is the consequence of a reasoning taking into 
account the semantics of ASP. Thus, if we want to limit as much as possible the 
number of rules and atoms to deal with, we have not to separate grounding and 
answer set computing. 

Example is a typical situation for planning problems where step * +1 must be 
generated only if the goal is not reached at step i. Such situations are not tractable 
by grounders. That is the reason why the number of steps needed to reach the goal 
(or at least the maximum number of allowed steps) is given as input of planning 
problems (in ASP competition for example). Yet it is rather counterintuitive having 
to know the step number to solve the problem before solving. 

The next example illustrates that the grounding phase generates too much infor¬ 
mation regarding the computation of one answer set. 


Example 2. Let be the program, as given in (Niemela 1999), encoding a 3- 
coloring problem on a vertices graph organized as a bicycle wheel (see below), v 
stands for vertex, e for edge, c for color, col for colored by, ncol for not colored by. 
v{l).,... ,v{N)., c{red)., c{blue)., c{green)., 
e(l,2).,...,e(l,iV)., 
e(2,3).,e(3,4).,...,e(7V,2)., 
col{V,C) v{V), c{C), not ncol{V,C)., 
ncol{y,C) col{y,D), c{C), C^D., 

^e{V,U), col{V,C), col{U,C). 

From current grounders generate about 18fV rules. If N is even then has 
no answer set and if N is odd then it has 6 answer sets. 




Suppose that % has an answer set in which there is col{l, red). Obviously, all the 
N —1 constraints like (<— e{l,U), col{l,red), col{U,red).) for all U e {2,...,iV} 
are necessary because they have to be checked. But, all the other constraints like (^ 
e{l,U), col{l,blue), col{U, blue).), and (^ e{l,U), col{l, green), col{U, green).) 
for all U S {2,..., iV} can be considered as useless since vertex 1 is not colored by 
blue or green. However, all these 2N — 2 constraints have been generated. So, the 
time consumed by this task is clearly a lost time and the memory space used by 
these data could have been saved. Thus, if we are searching for a single answer set, 
a lot of work would be done for nothing since the grounded program contains the 
enumeration of all solutions when only one is searched. 

The last exemple shows that when the number of solutions is very important 
ASP solvers have more difficulty to find one solution due to the grounding phase 
generating a lot of information concerning all the solutions. 

Example 3. Let % be the program, inspired from one given in ( [Niemela 1999[ ), 
encoding the Hamiltonian cycle problem in a vertices complete oriented graph, v 


(Niemela 1999 
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stands for vertex, a for arc, he for in Hamiltonian cycle, nhc for not in Hamiltonian 
cycle, s for start and r for reached. 


%]= < 


s(l)., v{l).,...,v{N)., a{X,Y)^v{X), v{Y)., 

hc{X,Y) ^ s{X), a{X,Y), not nhc{X,Y)., 
hc{X,Y) r{X), a{X,Y), not nhc{X,Y)., 
nhc{X,Y) ^ hc{X,Z), a{X,Y), Y ^ Z., 
nhc{X,Y) ^ hc{Z,Y), a{X,Y), X ^ Z., 
r(J)^hc{X,Y)., 
v{X), not r{X). 




This program has {N — 1)! answer sets. Whatever the number of desired solutions, 
current grounders generate about 2N'^ rules with he predicate as head and about 
2N^ rules with nhc predicate as head. Thus, even if we restrict our attention to the 
computation of one answer set, all the ASP solvers preceded by a grounding phase 
consume a huge amount of time when the graph has a few hundred vertices. 


This previous example illustrates another strange phenomenon. Sometimes, solv¬ 
ing a trivial problem, as finding one Hamiltonian cycle in a complete graph, is 
impossible for ASP systems. This is very counterintuitive since, in whole general¬ 
ity, in combinatorial problem solving the more solutions the problem has, the easier 
it is to find one of them. Again, the bottleneck for ASP systems seems to come from 
the huge number of rules and atoms that are generated in first, delaying and making 
the resolution more difficult than it should be. 

Beyond these particular examples, the point to stress is that grounders generate in 
extension all the search space (for all potential solutions) that they give then to the 
solver. But, this is clearly not the approach of usual search algorithms. A classical 
coloring algorithm does not firstly enumerate, in extension, all possible colorations 
for every vertex in the graph. A finite domain solver makes choices by instantiating 
some variables, propagates the consequences of these choices, checks the constraints 
and by backtracking explores its search space. Following this strategy it instantiates 
and desinstantiates variables describing the problem to solve all along its search 
process. But, it does not build, a priori and explicitly, all the possible tuples of 
variables and constraints representing the problem to solve. That is why we think 
that if we want to use ASP to solve very large problems we have to realize the 
grounding process during the search process and not before it. 

Is is important to notice that few works advocate the grounding of the program 
during the search of an answer set and not by a preprocessing. Some aim at solving 


the grounding bottleneck by combining ASP to constraint programming: (Baselice 


et al. 2005) proposes to reduce the memory requirements for a very specific class of 


programs, i.e. multi-sorted logic programs with cardinality constraints, (Balduccini 


2009) proposes an algorithm to make cooperate an ASP solver and a Constraint 


Logic Programming solver in such a way that ASP is viewed as a specification 
language for constraint satisfaction problems and (Ostrowski and Schaub 2012) de¬ 
scribes the Clingcon system which is a tight cooperation between the ASP solver 
Clasp and the Constraint Programming solver GeCode. The theory solvers (mainly 
arithmetic solvers) forbid instances that are in conflict with the constraints reduc- 











ASPeRiX 


7 


ing by this way the size of the grounding image. Some others works use a forward 
chaining of rules that are instantiated as and when required: GASP (iDal Palu et al. 


2009) and ASPeRiX (Lefevre and Nicolas 2009a Lefevre and Nicolas 2009b)) de¬ 


veloped at the same time, and more recently OMiGA (Dao-Tran et al. 2012). They 
are all based on the notion of computation given in (Liu et al. 2010). GASP is im¬ 
plemented in Prolog and Constraint Logic Programming over finite domains. Each 
rule instantiation and propagation is realized by building and solving a CSP. OMiGA 
is implemented in Java and uses an underlying Rete network for instantiation and 
propagation. ASPeRiX, which is the one presented in this article, is implemented in 
C-|—P. Instantiation and propagation are inspired by previous work realized on the 


DLV grounder which is based on the semi-naive evaluation technique of (Ullman 


1989D . 

Last, concerning a direct handling of first order programs, let us note that there 


exists some works (Gottlob et al. 1996 Eiter et al. 1997 Ferraris et al. 2007 Lin 


and Zhou 2007 Truszczynski 20121 dealing with first order nonmonotonic logic 


programs. These works establish some relations between stable model semantics 
and constraints systems or second order logic or circumscription but they are not 
really concerned by the explicit computation of answer sets. 


The present paper is an extended version of (Lefevre and Nicolas 2009a Lefevre 


and Nicolas 2009b). It details our approach of answer set computation that escapes 


the preliminary grounding phase by integrating it in the search process and includes: 


theoretical foundations of the approach, “mbt ASPeRiX computation”, with 
complete proofs; these computations are based on those of (Liu et al. 2010) 
and include use of constraints and must-be-true propagation in order to guide 
the search; 


• a detailed description of the main algorithms; 

• experimentations of the resulting system, ASPeRiX, and comparisons with 
other similar systems and other “classical” ASP systems. Our methodology 
is particularly well suited for: 


— solving easy problems with a large grounding, 

— finding only one answer set for a program whose search space is large and 
proportional to the desired number of solutions, 

— solving problems for which pre-grounding is impossible because domains 
are inhnite or open, or because some pieces of knowledge come from outside 
(distributed systems for example). 


The paper is organized as follows. In Section we recall the theoretical back¬ 
grounds about ASP necessary to the understanding of our work. In Section we 
present our first order rule oriented approach of answer set computation and its 
implementation in the solver ASPeRiX. In Section experimental results are pre¬ 
sented. We conclude in Section by citing some new perspectives for ASP as a 
result of our innovative approach. Proofs of theorems are reported in [Appendix B[ 
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2 Theoretical Background 

In this section, we give the main backgrounds of ASP framework useful to the 
understanding of this article. 

Set V denotes the infinite countable set of variables, set IFS denotes the set of 
function symbols, set CS denotes the set of constant symbols and set VS denotes 
the set of predicate symbols. It is assumed that the sets V, CS, TS and VS are 
disjoint and that the set CS is not empty. Function ar denotes the arity function 
from TS to N* and from VS to N which associates to each function or predicate 
symbol its arity. Set T denotes the set of terms defined by induction as follows: 

• if u G V then z; G T, 

• if c G CS then c G T, 

• if / G VS with ar{f) = n > 0 and ti,... ,tn G T then /(ti,... ,t„) G T. 

A ground term is a term built over only the two last items of the previous definition. 
The Herbrand universe is the set of all ground terms. Set A denotes the set of atoms 
defined as follows: 

• if a G VS with ar{a) = 0 then a G A, 

• if p G VS with ar{p) = n > 0 and ti,..., G T then p{ti ,..., t„) G A. 

A ground atom is an atom built over only ground terms. The Herbrand base denoted 
A is the set of all ground atoms. 

A normal logic program (or simply program) is a set of rules like 

c ■(—ai, On, not bi, not bm- n>0,m>0 (1) 

where c, ai,..., an, bi,..., bm are atoms. 

The intuitive meaning of such a rule is: ”if all the Oi’s are true and it may be 
assumed that all the bj’s are false then one can conclude that c is true”. Symbol 
not denotes the default negation. A rule with no default negation is a definite rule 
otherwise its is a nonmonotonic rule. A program with only definite rules is a definite 
logic program. A program is a propositional program if all the predicate symbols are 
of arity 0. 

For each program P, we consider that the set CS (resp. VS and VS) consists 
of all constant (resp. function and predicate) symbols appearing in P. These sets 
determine the set of ground terms and the set of ground atoms of the program. 
A substitution for a rule r G P is a mapping from the set of variables from r to 
the set of ground terms of P. A ground rule r' is a ground instance of a rule r if 
there is a substitution 9 for r such that r' = 9 [r), the rule obtained by substituting 
every variable in r by the corresponding ground term in 9. The program P (with 
variables) has to be seen as an intensional version of the program ground(P) defined 
as follows: given a rule r, ground{r) is the set of all ground instances of r and 
then, ground{P) = [J,,^p ground(r). Program ground{P) may be considered as a 
propositional program. Let us note that the use of function symbols leads to an 
infinite Herbrand universe, this point will be discussed in Section [3.5[ 
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Example 4. The program 
( n(l)., n(2)., 

< a{X) X- n{X), not b{X)., 
[ b(X) X- n{X), not a{X). 
is a shorthand for the program 
n(l)., n(2)., 


ground(I^ 


a(l) ^ n(l), 
6(1) ^ n(l), 
(r(2) — 77.(2), 
6(2) ^ 77(2), 


not 6(1)., 
not a(l)., 
not 6(2)., 
not a(2). 


For a rule r (or by extension for a rule set), we define: 


• head(r) = c its head, 

• body~^(r) = {oi,..., a„} its positive body and 

• body~{r) = {6i,..., 6^} its negative body. 

The immediate consequence operator for a definite logic program P is Tp : 2^ —>■ 
2^ such that Tp{X) = {head{r) \ r G P, body^{r) C X}. The least Herbrand 
model of P, denoted Cn{P), is the smallest set of atoms closed under P, i.e., the 
smallest set X such that Tp{X) C X. It can be computed as the least fix-point of 
the consequence operator Tp. 

The reduct P^ of a normal logic program P w.r.t. an atom set X C ^ is the 
definite logic program defined by: 

P^ = {head{r) G- body~'~{r). \ r G P, body~{r) n X = 0} 


and it is the core of the definition of an answer set. 


Definition 1. (Gelfond and Lifschitz 1988) Let P be a normal logic program and 
X an atom set. X is an answer set of P if and only X = Cn{P^). 

For instance, the propositional program {a ^ not 6., 6 ^ not a.} has two answer 
sets {a} and {6}. 

Example 5. Taking again the program ground{I^ has four answer sets: 
{a(l),a(2), 77(1), 77(2)}, {a(l), 6(2), 77 ( 1 ), 77 ( 2 )}, 

{a(2), 6(1), 77 ( 1 ), 77 ( 2 )}, {6(1), 6(2), 77 ( 1 ), 77 ( 2 )} 

that are thus the answer sets of % 

There is another definition of an anwer set for a normal logic program based on 
the notion of generating rules which are the rules participating to the construction 
of the answer set. These rules are important in our approach because they are 
exactly the rules fired in the ASPeRiX computation presented in the next section. 


Definition 2. (Konczak et al. 2006) Let P be a normal logic program and X be 


an atom set. GRp{X), the set of generating rules of P, is defined as GRp{X) = 
{r G P I body'^{r) C X and body~{r) n X = 0}. 


Definition 3. (Konczak et al. 2006) Let P be a set of rules. R is grounded if there 
exists an enumeration of the rules of R such that V? G [I..77], 6oc?7/+(rj) C 

head{{rj | j < zj). 
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The next theorem is inspired by (Konczak et al. 2006). In (Konczak et al. 2006) 


X is an answer set of a program P if and only ii X = Cn{GRp{X) ). It can be 
reformulated by: 


Theorem 1. (Konczak et al. 2006) Let P be a normal logic program and X be an 


atom set. Then, X is an answer set of P if and only if X = head{GRp{X)) and 
GRp{X) is grounded. 

Special headless rules, called constraints, are admitted and considered equivalent 
to rules like {bug ^ , not bug.) where bug is a new symbol appearing nowhere 

else. For instance, the program {o •<— not b., b •<— not a.,-^ a.} has one, and only 
one, answer set {b} because constraint (•<— a.) prevents a to be in an answer set. 

When dealing with default negation, we call a literal an atom, a, or the negation 
of an atom, not a. A literal a is said to be positive, and not a is said to be negative. 
The corresponding atom a of a literal I is denoted by at{l). For a literal I where 
at{l) = a, let us denote pred{l) the function such that pred{not a) = pred{a) = p 
with p the predicate symbol of the atom a. 

For purposes of knowledge representation, one may have to use conjointly strong 
negation (like —^a) and default negation (like not a) inside a same program. This is 


possible in ASP by means of an extended logic program (Gelfond and Lifschitz 1991) 


in which rules are built with classical literals (i.e. an atom a or its strong negation 
^a) instead of atoms only. Semantics of extended logic programs distinguishes in¬ 
consistent answer sets from absence of answer set. But, if we are not interested in 
inconsistent answer sets, the semantics associated to an extended logic program is 
reducible to answer set semantics for a normal logic program using constraints by 
taking into account the following conventions: 


• every classical literal —•x is encoded by the atom nx, 

• for every atom x, the constraint (<— x, nx.) is added. 

By this way, only consistent answer sets are kept. In this article, we do not focus 
on strong negation and literal will never stand for classical literal. 

Let us note that one can also use some particular atoms for (in)equalities and 
simple arithmetic calculus on (positive and negative) integers. Arithmetic opera¬ 
tions are treated as a functional arithmetic and comparison relations are treated as 
built-in predicates. 

Finally, a program P is said to be stratified iff there is a mapping strat from VS 
to N such that, for each ground rule like the two following conditions hold: 

• strat{pred{c)) > strat{pred{ai)) for all i G [l..n] 

• strat{pred{c)) > strat{pred{bj)) for all j G [l..m] 


3 A First Order Forward Chaining Approach for Answer Set 

Computing 

3.1 ASPeRiX Computation 

In this section, a characterization of answer sets for first-order normal logic pro¬ 
grams, based on a concept of ASPeRiX computation, is presented. This concept is 
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itself based on an abstract notion of computation for ground programs proposed 
in (Liu et al. 2010). This computation fundamentally uses a forward chaining of 
rules. It builds incrementally the answer set of the program and does not require the 
whole set of ground atoms from the beginning of the process. So, it is well suited to 
deal directly with first order rules by instantiating them during the computation. 

The only syntactic restriction required by this methodology is that every rule of a 
program must be safe. That is, all variables occurring in the head and all variables 
occurring in the negative body of a rule occur also in its positive body. Note that 
this condition is already required by all standard evaluation procedures. Moreover, 
every constraint (i.e. headless rule) is considered given with the particular head T 
and is also safe. For the moment we do not consider function symbols but their use 
will be discussed in Section [331 

An ASPeRiX computation is defined as a process on a computation state based 
on a partial interpretation which is defined as follows. 


Definition 4. A partial interpretation for a program P is a pair {IN, OUT) of 
disjoint atom sets included in the Herbrand base of P. 


Intuitively, all atoms in IN belong to a search answer set and all atoms in OUT 
do not. 

The notion of partial interpretation defines different status for rules. 


Definition 5. Let r be a ground rule and I = {IN, OUT) be a partial interpreta¬ 
tion. 

• r is supported w.r.t. / when body'^{r) C IN, 

• r is blocked w.r.t. / when body~{r) n IN 0, 

• r is unblocked w.r.t. I when body~{r) C OUT, 

• r is applicable w.r.t. I when r is supported and not blocked^ 

An ASPeRiX computation is a forward chaining process that instantiates and fires 
one unique rule at each iteration according to two kinds of inference: a monotonic 
step of propagation and a nonmonotonic step of choice. To fire a rule means to add 
the head of the rule in the set IN. 


Definition 6. Let P be a set of first order rules, / be a partial interpretation and 
P be a set of ground rules. 

• Apro(-P) I, R) is the set of all supported definite rules and supported unblocked 
nonmonotonic rules from ground{P) \ R. 

• AchoiP, U R) is the set of all applicable nonmonotonic rules from ground{P)\ 
R. 

It is important to notice that the two sets defined above, like the set ground{P), 
do not need to be explicitly computed. It is in accordance with the principal aim 
of this work that is to avoid their extensive construction. When necessary, a first- 
order rule of P can be selected and grounded with propositional atoms occurring 


^ The negation of blocked, not blocked, is different from unblocked. 
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in IN and OUT in order to define a new (not already occurring in R) fully ground 
rule member of Ap^o or Acho- Because of the safety constraint on rules this full 
grounding is always possible. These mechanisms are specified in more details in 
Subsection 3.3 The sets Ap^o and Acho are used in the following definition of an 
ASPeRiX computation. Specific case of constraints (rules with T as head) is treated 
by adding T into OUT set. By this way, if a constraint is fired (violated), T should 
be added into IN and thus, {IN, OUT) would not be a partial interpretation. 


Definition 7. Let P be a first order normal logic program. An ASPeRiX computa¬ 
tion for P is a sequence {Ri,Ii)^Q of ground rule sets Ri and partial interpretations 
li = {INi,OUTi) that satisfies the following conditions: 


• Po = 0 and Iq = (0, {T}), 

• (Revision) Vi > 1, 

(Propagation) Ri = Pi_i U {rj with ri e Ap™(P, /i_i, Pi_i) 
and li = {INi-i U {head{ri)},OUTi-i) 
or (Rule choice) Apro{P, li-i, Ri-i) = 0 , 

Ri — Ri— 1 U with ri ^ A^^oA— i? Pi—i) 
and li = {INi-i U {head{ri)}, OUR-i U body~{ri)) 
or (Stability) Ri = Ri-i and A = U-i, 

• (Convergence) 3i > 0, Acho{P, h, Ri) = 0 - 


The computation is said to converge to the set INao = UZoIN.- 


Example 6. Let % be the following program: 


n(l). 

nix + 1) ^ n{X), (X + 1) <= 2. 

< a{X)n{X),not b{X),not b{X A-1). > 

b{X) -p- n{X), not a{X). 

. c{X)n{X),not b{X A-1). 
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The following sequence is an ASPeRiX computation for 

h =(0,U}) 

ri = n(l). e Ap^o(% Jo,0) 

h =({n(l)},{T}) 

r 2 = n(2) ^ n(l). S A pro (%/i,{ri}) 

h =({n(l),n(2)},U}) 

^pro {%l2,{'>'i,r2}) = 0 

rs = a(l) ^ n(l), not 5(1), not 5(2). e Ac/jo(% h, {ri, r 2 }) 
h =({n(l),n(2),a(l)},{T,5(l),5(2)}) 

Ti = c(l) ^ n(l), not 5(2). e Apro(% h, {t"!, t’ 2 , J'a}) 
h = ({n.(l),n(2),a(l),c(l)},{_L,5(l),5(2)}) 

Apr-o(%-f4,{ri,r2,r3,r4}) = 0 

rs = a(2) ^ n(2), not b{2),not 5(3). € IXcho{I^ h, {ri, r 2 , ra, r4}) 
h = ({n(l), n(2), a(l), c(l), a(2)}, {T, 5(1), 5(2), 5(3)}) 

re = c(2) ^ n.(2), not 5(3). € Apro(% h, {ri,r 2 , re, n, rs}) 
le = ({n,(l),n(2),a(l),c(l),a(2),c(2)},{_L,5(l),5(2),5(3)}) 

Apr-o(%4,{ri,r2,r3,r4,r5,r6}) = 0 
^cho (%4,{?’i,r2,r3,r4,r5,r6}) = 0 

I? = le 

The previous ASPeRiX computation converges to the set 
{n-(l), n(2), a(l), c(l), a(2), c(2)} which is an answer set for 

The following theorem establishes a connection between the results of any ASPeRiX 
computation and the answer sets of a normal logic program. 

Theorem 2. Let P be a normal logic program and X be an atom set. Then, X 
is an answer set of P if and only if there is an ASPeRiX computation {Ri, 
li = {INi,OUTi), for P such that INao = X. 

Let us note that in order to respect the revision principle of an ASPeRiX com¬ 
putation each sequence of partial interpretations must be generated by using the 
propagation inference based on rules from Apro as long as possible before using 
the choice based on Acho in order to hre a nonmonotonic rule. Then, because of 
the non determinism of the selection of rules from Acho, the natural implementa¬ 
tion of this approach leads to a usual search tree where, at each node, one has to 
decide whether or not to fire a rule chosen in Acho- Persistence of applicability of 
the nonmonotonic rule chosen to be fired is ensured by adding to OUT all ground 
atoms from its negative body. On the other branch, where the rule is not fired. 
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the translation of its negative body into a new constraint ensures that it becomes 
impossible to find later an answer set in which this rule is not blocked. 

Propagation can be improved by using “must-be-true’|^ atoms: atoms which have 
to be in the answer set to avoid a contradiction or, in other words, atoms already 
determined to be in IN but which are not yet be proved to be in. 

Example 7. Let (_L ^ not b.) be a constraint whose body contains only one literal 
not b with b ^ IN U OUT. In order to have an answer set, b must be in IN so that 
the constraint is not applicable but b is not yet proved (it is not the head of a fired 
rule). Thus, one can only conclude that b must be true. 

Must-be-true atoms can be used during the propagation step in order to reduce 
the search space. 

Example 8 . Let (c a,b.) be a rule with a G IN and b ^ IN but b has been 
determined to be a must-be-true atom. The rule may be fired during the propagation 
step but one can only conclude that the rule head c must be true (because b is not 
yet proved). 

Must-be-true atoms can also be used to reduce the size of Acho, the set of nom- 
monotonic rules that can be chosen to be fired. 

Example 9. Let (c G- a, not d.) be a rule with a G IN and d ^ IN but d has been 
determined to be a must-be-true atom. The rule may already be considered to be 
blocked, even if d is not yet proved, and thus may be excluded from Acho- 

Note that must-be-true atoms are first used to improve propagation and choice 
but have to be proved later, otherwise the computation can not lead to an answer 
set. 

Notions of partial interpretation, rule status and ASPeRiX computation can be 
modified in order to consider these new elements. 

Definition 8 . Let P be a logic program. A mht partial interpretation for P is a 
triplet {IN, MBT, OUT) of disjoint atom sets included in the Herbrand base of P. 

Definition 9. Let r be a ground rule and I = {IN, MBT, OUT) be a mbt partial 
interpretation. 

• r is supported w.r.t. / when body'^{r) C IN, 

• r is weakly supported w.r.t. I when body^{r) C (IN U MBT) 

• r is blocked w.r.t. I when body~{r) n {IN U MBT) ^ 0, 

• r is unblocked w.r.t. I when body~{r) C OUT, 

• r is applicable w.r.t. I when r is supported and not blocked. 

Propagation is extended by Mbt-propagation: if some rule is weakly supported 
and unblocked w.r.t. mbt partial interpretation {IN, MBT, OUT) (but is not sup¬ 
ported, i.e., does not belong to Ap^o), then the head of the rule can be added in 
MBT set. And Acho, the set of rules that can be chosen, is restricted to the rules 
that are not blocked w.r.t. mbt partial interpretation. 


^ The term “must be true” is first used in (Faber et al. 1999 I 
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Definition 10. Let P be a set of first order rules, I = {IN, MBT, OUT) be a mbt 
partial interpretation and P be a set of ground rules. 

• Apro{P,I,R) = {r G ground{P) \ R \ body~^{r) C IN and body~{r) C OUT} 

• Apro_mbt{P,I,R) = {r G ground{P)\R \ body^{r) C INU MBT, body'^{r) % 
IN and body~{r) C OUT} 

• I^cho.mbt{P, I , R) = {r G ground{P) \ R \ body+{r) C IN, and body-{r) n 
{IN U MBT) = 0} 

A mbt ASPeRiX computation is an ASPeRiX computation with this additional kind 
of propagation and with the possibility to block a rule from Acho.mbt instead of hring 
it (“Rule exclusion”). To block a rule is to add a constraint with the negative literals 
of the rule body. If there is only one literal in the negative body, this constraint can 
be expressed by adding an atom in MBT set (see Example]^. These possibilities 
restrict rule choice in Acho.mbt and thus forbid some computations: if a rule r is 
blocked, computation can only converge to an answer set whose generating rules 
do not contain r. Note that Convergence principle impose that, at the end of a 
computation, no constraint is applicable and each atom from MBT set has been 
proved (i.e., was moved from MBT to IN set). 

Definition 11. Let P be a hrst order normal logic program. A mbt ASPeRiX com¬ 
putation for P is a sequence {Ki, R^, of ground rule sets Ki and Rt and mbt 

partial interpretations A = {INi, MBTi,OUTi) that satisfies the following condi¬ 
tions: 

• Ko = %, Rq =% and /q = (0 , 0, {T}), 

• (Revision) Vf > 1, 

(Propagation) Ki = ATj-i, 

Ri Ri —1 C with Ti G Apj~q{P, Ii_\, Ri_\) 
and li = {INi-i U {head{ri)}, MBTi-i \ {head{ri)},OUTi-i) 
or (Mbt-propagation) Ki = Ki-i, Ri = Ri-i, 

and li = {INi-i,MBTi-i U {head{ri)},OUTi-i) 
with Ti G Apco_mbt{Pi Ii — li Ri—l) 
or (Rule choice) Apro{P U ATi-i, A-i, P^-i) = 0, 

Apro.mbt (PC Ki—i , li—l, Ri— i ) = 0, 

K, = 

Ri — Ri—1 C with Ti G Acho_mbt{P'i li—l^ Ri—l) 

and li = {INi-i\j{head{Ti)},MBTi-i\{head{ri)},OUTi-iVJbody~{Ti)) 
or (Rule exclusion) Apro(PU ATi-i, A-i,Pi_i) = 0, 

Apro.mbt (PC Ki—i , li—l, Ri— i ) = 0, 

Ki — Ki—\, Ri — Ri—\ 

and h = {IN^-i, MBTi-i U body~{Ti),OUTi-i) 
with Ti G Acho.mbt{PjIi-i,Ri-i) and \body-{ri)\ = 1 

or Ki Ki—\ U '[T ^ ^bCibody~ ^'} 1 Ri Ri — 1 and A li—l 

with Ti G Acho.mbt{P, li-i, Ri-i) and \body-{ri)\ > 1 
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or (Stability) Ki = Ki-i, Ri = Ri-i and R = R-i, 

• (Convergence) 3i > 0, Acho.mbt{P U Ki,Ii, Ri) = 0 and MBR = 0. 

Mbt ASPeRiX computations characterize answer sets of a normal logic program. 
Completeness and correctness are established by the following theorem. 

Theorem 3. Let P be a normal logic program and X be an atom set. Then, X is 
an answer set of P if and only if there is a mbt ASPeRiX computation {Ki, Ri, Ii)°^Q, 
h = {IN,, MBT,, OUTi), for P such that IN^o = X. 

Note that computations model only successful! branches of a search tree. On the 
other hand, must-be-true atoms and rules blocking enable to prune failed branches 
of the tree and to reduce non determinism of the search by restricting the possible 
choices for the oracle (because some rules are explicitly excluded, and others are 
blocked by must-be-true atoms). So, these new elements do not improve the number 
of steps of a computation but they improve the number of steps needed to find a 
computation when there is no oracle to guide the search and, then, they make easier 
the search of answer sets. 


3.2 ASPeRiX Main Algorithm 

Now, we are interested in the practical computation of an answer set. The ASPeRiX 
algorithm, following the principle of mbt ASPeRiX computation seen in section [3d] 
is based on the construction of three disjoint atom sets IN, MBT and OUT during 
the search for an answer set. It alternates two steps. On the one hand, a propagation 
step which instantiates all supported and unblocked rules which may be built from 
IN, MBT and OUT and fires them, i.e. adds their head in IN (or MBT). On the 
other hand, a choice step which forces or prohibits a nonmonotonic instantiated 
applicable rule to be fired during the next propagation step. 

In order to treat the information more efficiently, the rules of a program P are 
ordered following the strongly connected components (SCC) of the dependency 
graph of P: the nodes of the dependency graph of a program P are its predi¬ 
cate symbols and the arcs are defined by {{p,q)\3r G P, p = pred{head{r)), q G 
pred{body^{r) Chody~{r))}. The strongly connected components {Ci, ...,Cn} are 
ordered in such a way that if z < j then no node (i.e. predicate symbol) of Ci de¬ 
pends of a node of Cj . A rule is said to belong to a SCC C if the predicate symbol 
of its head is in the component C. Note that constraints are not really concerned 
by ordering of rules but, for standardizing notations, constraints are considered to 
belong to a unique component whose number is greater than that of the last SCC, 
i.e., if Cn is the last SCC then constraints are considered to belong to Cn+i- 

Example 10. (Example continued) 

The strongly connected components (SCC) of the graph of the program are 
Cl = {n}, C 2 = {a, b} and C 3 = {c} (Figure]^. 

The ASPeRiX algorithm solves one by one the SCC {Ci, ...,Cn} of a program 
P by starting by Ci. When no propagation nor choice can no longer be done on 
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Fig. 2. Dependency graph of 


the current SCC, the predicate symbols of the SCC are said to be solved and 
the SCC too. It means that nothing can be deduced anymore for those predicate 
symbols. The atoms which are instances of the predicate symbols of the current 
SCC and which are not in IN are implicitly added to OUT. Note that they are not 
explicitly added to OUT because ground instances of a predicate are not known 
(not computed): they could be infinite and, even if finite, to compute and store 
them is useless. 

Rules of the program are instantiated on the fly during the propagation phase and 
the choice step. Hence the propositional program ground{P) which contains all the 
instantiated rules of the program is never really computed. The propagation step 
and the choice step are realized in the ASPeRiX algorithm thanks to the functions 
7 pro and ^cho (which are selection functions in Ap^o U Apro_mbt and Acho.mbt sets 
used in the mbt ASPeRiX computation of Subsection 3.1). The jpro function searches 
for a weakly supported unblocked rule amongst the current and next non-solved 
SCC. So propagation operates on several components: each rule is fired as soon as 
possible to quickly detect a possible conflict. Rules instantiated by jpro are stored 
in a set Subst = {subst(ri),subst{rn)} during the answer set search to mark 
the substitution of rules that have already been used. For each first-order rule rj, 
subst(ri) denotes the set of all substitutions 9 such that 9{ri) has already been fired. 
And substjrule{ri) = UsGsu&st(’’ instantiated rules obtained 

thanks to substitutions subst{ri). The notation is extended to a set R of first- 
order rules: substjrule{R) = \J^_^p^substjrule(ri). The ^cho function chooses an 
applicable rule in the current SCC when nothing can no longer be propagated. So 
choice, unlike propagation, operates only on the current component. This strategy, 
consisting of solving the SCC one after another, makes it possible to solve efficiently 
stratified programs (or some stratified parts of programs). 

Functions 7pro and ^cho are specified in more details in Subsection 3.3 and are 
defined informally as follows: 


• XproiR, S, S',T, SCC, Subst): nondeterministic function which selects a rule 
(or a constraint) r belonging to a SCC greater or equal to the current SCC in 
the dependency graph of a program P such that body~^ (r) C SUS', body~ (r) C 
T and r € ground{P)\subst_rule{P) or returns NULL if no such a rule exists. 

• JchaiP, S, S', T, SCC, Subst): nondeterministic function which selects a rule r 
belonging to the current SCC in the dependency graph of a program P such 
that body^(r) C S, body~ {r)r\{Syj S') = 0 and r G ground{P)\subst.rule{P) 
or returns NULL if no such a rule exists. 
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Algorithm 1: solve 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 


Function solve Pk, IN, MBT, OUT, SCC, Subst); 
/ / search of one answer set for a program P = Pr U Pk 
repeat //Propagation phase 

ro ^ lpro{PR U Pk, IN, MBT, OUT, SCC, Subst); 
if ro ^ NULL then 

if {body^{ro) n MBT) ^ 0 then 
I MBT ^ MBT U {/lead(ro)}; 
else 

IN ^ IN U {head{rQ)}; 
if {head{rQ) G MBT) then 
[ MBT ^ MBT\{head{rQ)}; 


12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 


until ro = NULL; 

if {{IN U MBT) n OUT / 0) then //Contradiction detected 
I return no-answerset; 

else 

ro ^ IchoiPR, IN, MBT, OUT, SCC, Subst); 
if rg 7 ^ NULL then //Choice point 

stop G- solve{Pii, Pk, IN, MBT, OUT U body~ (rg), SCC, Subst); 
if stop = no-unswerset then 

atoms G- {a|a G body~ {ro),pred{a) G pred{SCC)}; 
if (|atoTOs| = 1) then 
I MBT G- MBT U atoms; 
else 

t Pk ^ Pk ^ {-L ^ Ca iGatoms 


25 


stop solve{PR^ Pk-> in, MBT, OUT, SCC, Subst); 


26 

27 

28 

29 

30 

31 

32 

33 

34 

35 


I return stop ; 
else // The SCC is solved 

if pred{MBT) D pred{SCC) = 0 then 
if Nast{SCC) then 

I return solve{PR, Pk, IN, MBT, OUT, SCC + 1, Subst); 
else 

if JcheckiPK, IN,MBT,0UT, SCC) then // a constraint is 
violated 

I return no-answerset; 
else // An answer set has been found 

^ return IN; 


36 

37 


else //a MBT atom can not be proved 
1^ return no-answerset; 
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The function solve of Algorithm specifies the algorithm of the search of one 
answer set for a program P. The set Pk is the set of constraints (rules with the 
symbol T at their heads) of P and Pr the other rules. By default, T is into the set 
OUT. Then, if a constraint is fired, a contradiction is immediately detected since 
T is added into the set IN and the sets IN and OUT are no longer disjoint. The 
algorithm of the function solve computes one answer set (or none if the program is 
incoherent) thanks to the variable stop which stops the search once an answer set 
has been found. This algorithm may be easily extended to compute an arbitrary 
number of answer sets. Let us note that, for sake of simplicity, the function solve 
will return either a set (when there is an answer set) or the constant no^answer_set 
if there is no answer set. 

The main parts of the function solve are now described. Initially, IN = 0, 
MET = 0, OUT = {T}, SCO is the index of the first SCC and Subst = 0. 

The propagation phase successively hres each weakly supported and unblocked in¬ 
stantiated rule tq. At each step, the call 7pro(-Pi?U^if, IN, MET, OUT, SCC, Subst) 
selects and instantiates a unique unblocked rule Tq such that body'^{rQ) C IN U 
MET (line|^. If such a rule exists, its head atom head{rQ) must belong to the 
answer set. This head atom is added into the set IN (line[^ if the positive body 
of the rule is included in the set IN oi added into the set MET (line[^ otherwise 
since at least one atom a of the positive body of the rule has not yet proved its 
membership to the set IN {a G MET but a ^ IN). Moreover, a head atom which 
is added into IN must be deleted from MET since a proof of its membership to 
the answer set has been found (line[IO|). When there is no more unblocked rule tq 
such that body~^{ro) C IN U MET, {IN U MET) n OUT = 0 is checked in order 
to detect a contradiction (line 131. If no contradiction is detected, the algorithm 
begins the choice step. 

The choice point forces or forbids a nonmonotonic applicable rule to be fired. The 
call jcho{PR, IN, MET, OUT, SCC, Subst) selects and instantiates a unique 
applicable rule of Pr whose head belongs to the current SCC (line 16). If such 
a rule exists, vq is forced to be unblocked and then will be fired during the next 
propagation phase: its negative body is added to the OUT set and function solve 
is recursively called with its new parameters (line 18). If a recursive call to the 
function solve detects a contradiction, the algorithm backtracks on the last choice 
point on the rule vq which has been forced to be fired and blocks it (lines [l9||^ ): 
if a is the only atom of the negative body of vq then a is added to the set MET 
(line [2^ else a constraint including all the atoms of the negative body of vq is added 
to the program (line[24|). More precisely, the only atoms of the negative body that 
are considered are those with a predicate symbol belonging to the current SCC 
because atoms from a lower SCC are already solved, i.e. they are in IN or OUT. 
When there is no more choice point, the current SCC is solved (line[27| but it must 
be checked that no atom of the MET set has a predicate symbol in the current 
SCC (line [2^. If such an atom exists, MET and OUT sets are not disjoint. Indeed, 
if a SCC is solved, atoms which are instances of predicate symbols of the SCC 
and which are not \n. IN are implicitly added to OUT. Then if a MET atom is 
an instance of a predicate symbol of the current SCC, a failure is observed and 
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the backtrack process continues (line 36). If the last SCC is solved, the set IN 
represents an answer set of P if no constraint is applicable. This test is realized 
thanks to the nondeterministic function "fcheck (line |32[ ) which is specified in more 
details in Subsection |3.3| and is defined informally as follows: 

lcheck{P-i S, S',T, SCCy. function which checks if there is any constraint c such 
that body~^{c) C S, body~{c) C S = % and c G ground{P). 


Example 11. The execution of the ASPeRiX algorithm for program i^of Examplej^ 




({n(l),n(2)}.0,{±}) 
no choice point 
SCCl solved 


({n(l),n(2)},0,{l}) 


choice CPI: 

a(l) ■(— 6(l)i7ioi 6(2). 



{{a(l),c(l),n(l).n(2)}.0,{6(l),6(2),X}) 
choice CP2: 

a(2) 77(2).Tiof b(2),not 6(3). 



({a(l),a(2),c(l),c(2),n(l),n(2)}, 

0,{6(l),6(2),t(3).l}> 

no choice point 
SCC2 solved 


<{a(l),a{2),c(l),c{2),n(l),n(2)}, 

0.{Ml).6(2).b(3),-L}> 

no choice point 
SCC3 solved 
SUCCESS 


{{a(l),c(l),n(l),n(2)}, 
0,{6(1),6(2),!}) 


choice CPS: 

6(2) ^ 77(2),77oi a(2). 



({a(l),6(2),c{l),n(l),n(2)}. 
0,{a(2),6(l),6(2),l}) 
b(2) into IN and OUT 
FAILURE 


({a(l),dl),n(l),n(2)}, 
{a(2)},{6(l),6(2),±}) 
no choice point 
SCC2 cannot be solved 
{MBT^ 0 ) 

FAILURE 


Fig. 3. The tree-shaped execution of the answer sets of program 
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is represented by a tree in FigureAt the beginning IN = 0, MBT = 0, OUT = 
{_L} and the current SCC is the component Ci = {n}. After the first propagation, 
n(l) and u(2) are in IN thanks to the two rules ( 7 t,( 1 ).) and (u(2) ^ n(l), (1 + 1) <= 
2.). No choice point exists and the first SCC is solved since the MBT set is empty. 
The component C 2 = {a, b} becomes the current SCC. 

The first choice is realized on the current SCC (choice point CPI): the rule 
(a(l) n{l),not 6 ( 1 ),not 6 ( 2 ).) becomes unblocked by adding 6 ( 1 ) and 6 ( 2 ) into 
the set OUT (left branch after choice point CPI). A new propagation phase shows 
that a(l) and c(l) are in IN since (a(l) ^ n(l),not 6(1),not 6(2).) and (c(l) ^ 
n(l),not 6(2).) can be fired. Then, a new choice is realized (choice point CP2) and 
the rule (a(2) ^ n(2), not 6(2), not 6(3).) is forced to be unblocked (left branch after 
choice point CP2). The atom 6(3) is added into the set OUT. A new propagation 
phase shows that a(2) and c(2) are in IN since (a(2) n(2),not b{2),not 6(3).) 
and (c(2) ^ n(2),not 6(3).) can be fired. The second SCC is solved since no other 
rule is applicable and the MBT set is still empty. In the same way, no propagation 
nor choice point is possible in the SCC C 3 = {c}. Since no constraint is applicable, 
a first answer set is obtained: {a(l), a( 2 ), c(l), c( 2 ), n(l), n( 2 )}. 

If another answer set is wished, the algorithm backtracks to the last choice point 
on (a(2) ^ n{2),not 6(2), not 6(3).) of the component C 2 and blocks the rule (right 
branch after choice point CP2) by adding a constraint (T x- not 6(2), not 6(3).) into 
Pk- a new choice is realized (choice point CPi) and the rule ( 6 ( 2 ) ^ n( 2 ), not a{2).) 
is forced to be unblocked (left branch after choice point CPS) by adding a(2) into 
the OUT set. During the propagation step, 6(2) is added into the IN set since 
(6(2) ^ n{2),not a(2).) is fired. The atom 6(2) is then simultaneously in the sets 
IN and OUT which leads to a contradiction. 

The algorithm backtracks to the choice point (6(2) ^ n{2),not a(2).) of the 
component C 2 (choice point CP3) and the rule is blocked by adding the atom 
a(2) into the MBT set (right branch after choice point CP3). Since there is no 
more possible choice and the MBT set contains an atom whose predicate symbol 
is in the current SCC, this atom cannot be proved and this leads to a failure. The 
algorithm backtracks to the first choice point on (a(l) ^ n(l),not 6 ( 1 ),not 6 ( 2 ).) 
of the component C 2 (choice point CPI) and blocks the rule and searches for a 
new possible answer set (right branch after choice point CPI). The process keeps 
going until the whole tree is computed when all the answer sets are required. Let 
us note that when dealing with the computation of one answer set like explained 
in the algorithm, only the first branch is considered. 


3.3 Functions 7 

Functions 7 have a crucial role in two important steps of the search of an answer set. 
The function jpro is called during the propagation step in order to choose the rules 
to fire and then to add their heads into IN (or MBT). The function 'ycho is called 
during the choice step in order to force or to forbid a rule to be fired during the next 
propagation step. The function jcheck is called during the verihcation step in order 
to verify that no constraint is applicable. Since the principle of the solver ASPeRiX 
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is to instantiate the rules on the fly during the search of an answer set, functions 
7 need to call a function instantiateRule which searches for a substitution for the 
atoms of a rule. This function is detailed in its own Subsection 13.41 


Function ■jpro- The function ■jpro searches for a rule to fire w.r.t. IN, MBT and 
OUT sets. This function computes a complete instantiation of a rule such that 
the positive body is in IN U MBT and the negative body is in OUT. The rule 
to instantiate is chosen amongst a set of rules R consisting of rules that could 
lead to new, unprocessed instances. These rules are those whose body contains 
some predicate symbol of what we call an atom to propagate. Atoms to propagate 
are atoms recently added into IN, MBT and OUT sets, and not yet used for 
propagation phase. Thereby, when an atom a is added into IN or MBT (resp. 
OUT) set, the rules containing pred{a) in their positive body (resp. negative body) 
will be in the R set for the next call to ■jpro in order to propagate this atom, 
i.e. to use its presence in IN or MBT (resp. OUT) for building new instances of 
rules to be fired. During the first call of the function solve, atoms to propagate 
are the facts of the program, and the set R contains all the rules which have some 
predicate symbols of the facts in their positive body. During a call after a choice 
point, atoms to propagate are those added into OUT during this choice point, and 
the set R contains all the rules which have some predicate symbols of these atoms in 
their negative body. During a call after the access to the next SCC, the predicate 
symbols of the current SCC are solved and then all instances of these predicate 
symbols that are not in IN are implicitly added into OUT. Atoms to propagate 
are all these instances determined to be false, and then the set R contains all the 
rules which have in their negative body some of these solved predicate symbols. 

The Algorithml^of the function jpro chooses a first-order rule r amongst the set R 
(the first one, line™ and tries to find a weakly supported unblocked instantiation of 
the rule. It calls the function instantiateRule which returns this next instantiation 
if any (line[^. If there is no more weakly supported unblocked instantiated rule 
which may be extracted from r (line , jpro deletes from R the rule r and treats 
the next rule. This process is repeated until a weakly supported unblocked rule is 
found or there is no more rule in R. When a rule allows a substitution (line 10), 
the latter is stored in Subst in order to find some others at the next call to jpro- 


Example 12. Examplej^is taken again. An answer set is searched after the choice 
point on the rule (a(l) ^ n{l),not b{l),not 5(2).) (choice point CPI): the atoms 
5(1) and 5(2) are added into the set OUT in order to force the rule to be fired 
(left branch after choice point CPI). During the propagation step, many calls to 
the function jpro are executed. During the first call the set R consists of all the 
rules containing in their negative body the predicate symbol 5 of the atoms 5(1) 
and 5(2) that must be propagated. This set R then contains the rules (a(A') ^ 
n{X),not b{X),not b{X + 1).) and (c(A) n{X),not b{X -\- 1).) Arbitrarily, the 
rule (a(X) ^ n{X),not 5(A), not b{X + 1).) of the set R is chosen and a supported 
unblocked instantiation (a(l) ^ n{l),not 5(1),not 5(2).) is found. The function 
Jpro returns the instantiation of the rule and the solve function adds a(l) into IN. 
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Algorithm 2: 'jpro 


1 

2 

3 

4 

5 


6 

7 

8 

9 

10 


11 

12 

13 

14 


Function 7 pro(-P, MBT^ OUT^ SCC^ Suhst)\ 

R ^ Set of rules (including constraints) containing predicate symbols to 
propagate; 
if A 7 ^ 0 then 
repeat 

r ^ first{R)\ 

/* Searching for an instaintiation of the rule r with 

body^{r) C IN U MET and body~{r) C OUT */ 

^ instantiateRule{r, jpro, dN^ MBT^ OUT^ subst{r))] 

li 9 = NULL then 
L R^R\{r}; 
until 0 ^ NULL orR = 9; 
ii 0^ NULL then 

/* An unblocked weakly supported instantiated rule is found 
*/ 

subst{r) ^ subst{r) U {0}; 
return 0 (r); 
else 

1^ return NULL; 


15 else 

16 1^ return NULL; 


During the next call to ^pro, the set R must contain, in addition to the previous 
rules, any rule containing in its positive body the predicate symbol a of the atom 
to be propagated a(l) (since a(l) has been added into IN). Since no rule respects 
this condition, the set R still contains only the two previously added rules. The 
function ^pro searches for a new weakly supported unblocked instantiation of the 
rule (a(X) ^ n(X),not b(X),not b(X + 1).). No such instantiation is found and 
the rule is deleted from the set R. The function jpro searches for a new weakly 
supported unblocked instantiation of the rule (c(X) ■(— n(X),not b(X + 1).). The 
instantiation (c(l) •<— n(l),not 6 ( 2 ).) is then returned to the solve function which 
adds c(l) into IN. Then during the next call to the function the set R must 
be updated with the rules containing in their positive body the predicate symbol 
c of the atom to propagate c(l). As previously, no rule respects this condition and 
the set R still contains the only rule (c(A) ^ n{X),not b{X + 1).). A new weakly 
supported unblocked instantiation is sought but this rule leads to a failure. The 
rule (c(A) ^ n{X),not b{X + 1).) is then deleted from the set R which becomes 
empty. Then the function "fpro returns the value NULL and the propagation step 
of the function solve stops. 
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Function 'fcho- The function "fcho is executed when no rule can be fired anymore 
and there is some SCC to be solved. This function searches an applicable instanti¬ 
ated rule belonging to the current SCC. The Algorithmj^of function ^cho is similar 
to the algorithm of the function jpro- The function "fcho searches for an applicable 
instantiated rule amongst a set R of rules which have in their negative body at least 
one predicate symbol from the current SCC (otherwise, if all predicate symbols from 
negative body belong to previous SCC, they are already solved and then the rule 
can be considered as a monotonic one and is only used for propagation). The func¬ 
tion "fcho chooses a rule in this set R before calling the function instantiateRule 
searching for the next applicable instantiation for the considered rule. In a similar 
way as the function "fpro, the process is repeated until an applicable instantiated 
rule is found for a rule of R or there is no more rule in R. 


Algorithm 3: "fcho 


1 

2 

3 

4 

5 


6 

7 

8 


Function "fcho{P, INi MBT, OUT, SCC, Subst); 

R ^ Set of rule belonging to the current SCC such that the negative body 
contains at least a predicate symbol not solved; 
if i? ^ 0 then 
repeat 

r ^ first{R)-, 

/* Searching for an instauitiation of the rule r with 

body'^ir) C IN auid body~ {r) C {IN U MBT) = 0 
9 ^ instantiateRule{r, "fcho, IN, MBT, OUT, subst{r)); 

ii 9 = NULL then 

L R^R\{r}; 


*/ 


9 

10 

11 

12 

13 

14 


nntil 9 ^ NULL or i? = 0; 
ii 9^ NULL then 

/* An applicable instantiated rule is found 
subst{r) ^ subst{r) U {0}; 
retnrn 9{r); 
else 

1^ return NULL; 


*/ 


15 else 

16 1^ return NULL; 


Example 13. Exampleis taken again. After the first SCC has been solved, a first 
choice is realized on the current SCC, C 2 = {a, 6}, by the function "fcho- The rules of 
this component which contains in their negative body at least one predicate symbol 
a or 6 of C 2 are added into the set R of the rules that may be chosen. Then, the rules 
(a(A) ^ n{X),not b{X),not b{X -j- 1).) and {b{X) ^ n{X),not a{X).) are in R. 
Arbitrarily, the function "fcho searches for an applicable instantiation of the first rule 
of this set and a choice point on (a(l) n(l), not 6(1), not 6(2).) is returned to the 
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calling function solve (choice point CPI). After the propagation step, jcho searches 
for a new applicable instantiation of the rule (a(Ar) ^ n{X),not b{X),not 5(X+1).) 
and a choice point on (a(2) ^ n{2),not b(2),not 6(3).) is returned to the calling 
function solve (choice point CP2). After a new propagation step, 'ycho searches in 
vain a new applicable instantiation of the rule {a{X) x- n{X),not b{X),not b{X + 
1).) This last rule is then deleted from the set R and "fcho searches for an applicable 
instantiation of the rule (6(X) n{X),not a{X).) which leads to a failure. The set 
R is now empty and the function ^cho returns NULL to the calling function solve 
to mean that no other choice may be realized on the current SCC. 

Function "/check • The function "/check is executed when no more choice point is 
possible for the last SCC. This function verifies that no constraint containing at 
least one predicate symbol of the last SCC is applicable in order to determine if 
the set IN is an answer set. The Algorithm of the function "/check is similar to 
the algorithm of the function "/cho- The function "/check searches for an applicable 
instantiated constraint amongst a set C of constraints whose negative body contains 
at least a not-solved predicate symbol of the last SCC. The function "/check chooses 
a constraint in the set C and calls the function instantiateRule which searches 
for an applicable instantiated constraint. If no instantiated constraint is applicable, 
the algorithm returns false and the set IN is an answer set of the program. If a 
constraint is applicable, the algorithm returns true which means there is a failure 
on the branch (the search of answer sets keeps going on other branches if any). 


3-4 Rule Instantiation 


In this section is described the process of instantiation of a rule. This process is a 
lazy one only called when needed. Since we only consider safe rules, the instantiation 
of a rule is in fact the instantiation of its positive body. In a forward chaining 
approach, the only rule instantiations of interest are those that lead to a not blocked 
supported rule or an unblocked weakly supported rule. Hence, the rule instantiation 
is mainly directed by the instantiated atoms already present in the sets IN and 
MET. 

The algorithm used in the ASPeRiX solver and described below is inspired by the 


previous work realized on the DLV grounder (Faber et al. 2012 Perri et al. 20071 


which is based on the semi-naive evaluation technique of (Ullman 1989). The goal 


is to find a substitution for all the literals of the body of a rule r thanks to the 
atoms already in IN^ MET or OUT. To do this, a partial substitution 9 is built as 
possible values are found for the variables of the literals of the body of the rule r. It is 
assumed that the literals h, I 2 , . ■., In of the body of the rule r are ordered following 
a list [li,l 2 ,... ,ln]- firstLiteral{r) (resp. lastLiteral{r)) corresponds to h (resp. 
In) and previous Liter al{r) (resp. nextLiteral{r)) corresponds to the literal which 
precedes (resp. follows) the literal under consideration in the list. The substitution 
calculus for a literal I of a rule r is realized thanks to the functions first Match 
and nextMatch. These functions look for a substitution which has not already been 
computed, i.e. not leading to a substitution for r present in the set suhst{r) of all 








26 


Claire Lefevre, Christopher Beatrix, Igor Stephan, Laurent Garcia 


Algorithm 4: Function jcheck 


1 

2 

3 

4 

5 


6 

7 

8 


Function j^heckiP, IN, MBT, OUT, SCC)-, 

C ^ Set of constraints such that the negative body contains at least a 
predicate symbol not solved; 
if C 7 ^ 0 then 
repeat 

c •<— first{C); 

/* * Searching for an instauitiation of the constraint c such 
that body^(c)QIN and body~ (c) (11N = 0 */ 

9 ^ instantiateRule{c, 'ycheck, IN, MBT, OUT, 0); 
ii 9 = NULL then 
L C' ^ C'\{c}; 


9 

10 

11 

12 

13 


until 9 yf NULL orC = %■, 
it 9^ NULL then 

/* An applicable instantiated constraint is found 
return true; 
else 

1 ^ return false; 


*/ 


14 else 

15 1^ return false; 


substitutions 9 such that 9{r) has already been fired. If the literal I is positive, 
a substitution such that the substituted atom is in the set IN (or IN U MBT) 
is searched. If the literal is negative, (a) a substitution such that the substituted 
corresponding atom is in the set OUT is searched if the goal is an unblocked rule or 
(b) the non membership of the substituted atom to the set IN U MBT is checked 
if the goal is a not blocked rul^ 

In the functions first Match and nextM atch which follow, the parameter 7 shows 
if an unblocked weakly supported or not blocked supported rule is looked for. 

• firstMatch{l,9,^,IN,MBT,OUT,subst) is a function which searches for 
the first possible substitution for a literal I w.r.t. the sets IN, MBT and 
OUT, selection criterion 7 (unblocked weakly supported or applicable rule) 
and the current partial substitution 9. firstMatch returns true and updates 
the partial substitution 9 in case of success. Otherwise, the function returns 
false. 

• nextMatchil, 9, 7 , IN, MBT, OUT, subst) is a function which searches for the 
next possible substitution for literal I given the already realized substitutions. 

For a rule r, a free variable of a literal I is an occurrence of a variable X such that it 
is its first occurrence in the body of r when starting traversing the literal 1. In other 

® In this case, the body of the rule is ordered in such a way that negative literals appear after the 
positive literals containing their variables. 
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words, no other literal which precedes I in the body of r contains an occurrence of 
the variable X. During the instantiation of a rule, a possible substitution is sought 
for all the free variables of every traversed literal and the substitutions of the 
previously calculated variables are kept. If a literal has no free variable, the validity 
of the substitution w.r.t. the selection criterion 7 is checked (i.e. the substituted 
corresponding atom 0{at{l)) is in IN or IN U MBT if I G body'^{r) and 9{at{l)) is 
in OUT or 9{at{l)) is not in IN if I € body~(r)). 

Example 14. Let (a(X, Y, Z) x- b{X, Y),c{X, Y), d{X, Z).) be a rule. The ordered 
list of the body of the rule is [h = b{X, Y), I 2 = c{X, Y), I 3 = d{X, Z)] with: 

• freeVariables{li) = {X,Y} 

• freeVariables{l 2 ) = 0 

• freeVariables{l 3 ) = {Z}. 

Function instantiateRule of Algorithm specifies the instantiation principles of 
a rule for constant sets IN, MBT and OUT. This function is initialized with the 
partial substitution 9 which is the last found substitution (thanks to the function 
lastSubstitution) for the rule r if any (line[^. If it is the first attempt for the instan¬ 
tiation of this rule, 9 is empty (line[^ and the function searches a first substitution 
for the first literal of the body of the rule r using the function firstMatch. Other¬ 
wise, a substitution for r has already been computed (line[^, the function searches a 
new possible substitution for the rule. For this, the function searches the next possi¬ 
ble instance of the last literal of the rule r by deleting from 9 the substitutions of the 
free variables of this literal (thanks to the function freeVariableSubstitutions) and 
by calling the function nextMatch. During the execution of the main loop, the func¬ 
tion first checks if a substitution has been found for the current literal a (line[TT|). 
If it is the case, it searches a first substitution for the next literal of the rule body 
respecting the partial substitution 9. When all the atoms have been considered, 
a complete substitution is found (line |15[ ). The function returns this substitution. 
When the instantiation of a literal fails (i.e. there is no possible substitution for it), 
the function backtracks on the previous literal (line and updates 9 by deleting 
the substitutions of the free variables of this literal. Hence the function calls the 
function nextMatch which searches the next possible instantiation for this literal. 
The instantiation of a rule r fails when no more substitution is possible for the first 
literal (line [ 2 ^. 

Actually, the instantiation algorithm of a rule is slightly more complicated than 
the Algorithm since the atoms dynamically added into IN, MBT and OUT 
sets during the answer set computation, called atoms to propagate, have to be 
taken into account: if possible, each substitution has to be computed once and only 
once. Hence ASPeRiX uses a queue called propagate-!N (resp. propagate-MBT and 
propagatc-OUT) which contains the atoms to be added into the set IN (resp. MBT 
and OUT). When all the instances of a rule r, for given sets INq, MBTq and OUTq, 
have been generated, an atom to propagate Op whose predicate symbol p appears 
in the body of the rule r is extracted. Now Ii = {INi, MBTiOUTi) denotes the 
mbt partial interpretation obtained by adding Op into Iq = {INq,MBTq,OUTq). 
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Algorithm 5: instantiateRule 


1 

2 

3 


4 

5 

6 


7 

8 
9 


Function instantiateRule(r, 7 , IN, MBT, OUT, subst); 

9 ^ lastSubstitutionlr)-, 

if 6 * = 0 then 

/* Searching for the first possible substitution of first 
literal 

I ^ f irstLiteral{r); 

matchFound ^ firstMatch{l, 9, 7 , IN, MBT, OUT, subst); 
else 

/* Searching for the next possible substitution of last 
literal 

I ^ lastLiteral{r); 

9 ^ 9\freeVariableSubstitutions{l); 

matchFound nextMatch{l, 9, 7 , IN, MBT, OUT, subst); 


*/ 


*/ 


10 

11 

12 

13 

14 

15 


while true do 

if matchFound then 

if I 7 ^ lastLiteralir) then 
I ^ next Liter al{r); 

matchfound ^ firstMatch{l, 9, 7 , IN, MBT, OUT, subst) 
else 


16 


/* A complete substitution is found 
return 9; 


*/ 


17 


18 

19 

20 
21 
22 
23 


else 

/* No substitution for literal 1. Bactrack to previous 

literal (if any) to find its next possible substitution 
*/ 

if I 7 ^ firstLiteral{r) then 
I ^ previousLiteral{r); 

9 ^ 9\freeVariableSubstitutions{l); 

matchFound ^ nextMatchll, 9, 7 , IN, MBT, OUT, subst); 

else 

1^ return NULL; 


The body of the rule is ordered in such a way that the first literals are those whose 
predicate symbol is the one of the atom to propagate Up (they are the literals 
that might unify with Op). Then these literals whose predicate symbol is p are 
successively marked and placed at the beginning of the rule. The marked literal 
might only take the value of the atom to propagate Op whereas the following (non 
marked) literals might take any values in R. Then, if the instantiation of the first 
literal fails, it is unmarked, the next literal of predicate p becomes the first literal 
of the rule body and is marked in turn, and the instantiation of the rule is started 
again. The unmarked literals might then take any values in Iq (which excludes the 
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values of Cp already used) while the marked literal can only take the value of the 
atom to propagate, and the non marked literals always take their values in Ii. If 
the instantiation of the first literal fails and there is no other literal to be marked, 
the instantiation of the rule fails. 

Example 15. Let tq be a rule and INq, propagateJN and INi be sets of atoms 
defined as follow: 

ro = a{X + Y)^ a(X), b(X, Y),a(Y). 

IXo = {b(l,l),Hl,2)} 
propagateJN = {a(l)} 

/iVi ={6(1, !),&(!, 2), a(l)} 


a{X + Y) 

^ a{X), a{Y), 

b{X,Y) 



*** 

first call to 

instantiateRule *** 


(1.1) 

a(i; 

1 

- 

(Zi marked) 

(1.2) 

a(i; 

1 a(l) 

- 


(1.3) 

a(i; 

1 a(l) 

6(1,1) 

=^- complete instantiation 


*** 

second call 

to instantiateRule *** 


(2.1) 

a(i; 

1 a(l) 

NO 


(2.2) 

a(i; 

1 NO 

- 


(2.3) 

NO 

- 

- 

failure 

(2.4) 

- 

a(l) 

- 

{I 2 marked) 

(2.5) 

NO 

a(l) 

- 

^ failure 


Table 1. Decomposition of the instantiations of the rule rg for the atom to propagate 
a(l) (Example [T^ 


The atom a(l) has to be propagated by instantiating the rule rg. Table shows 
the different steps of the instantiation. The literals to be marked (whose predicate 
symbol is a) of the body of the rule are a{X) and a{Y). These literals are placed 
at the beginning of the body of rg like this: [li = a{X),l2 = o-iX), I3 = b{X, E)]. In 
Table for clarity, the sequence of literals of the rule body is not changed when the 
marked literal changes. But the marked literal (shown in bold) is processed first, 
which is the same. The first attempt for an instantiation begins and for the first 
time with atom to propagate a(l). The literal h = a{X) is then marked and takes 
as unique value that of the atom to propagate a(l) ((1.1) Table[^. Hence, value 1 
is substituted to the variable X va 0. Then, the following literal in the body of the 
rule, I 2 = a(T), becomes the current literal and takes as value the first amongst 
those into INi which is also a(l). Hence, value 1 is substituted to the variable Y 
in 0 ((1.2) Table[^. Then the last literal, Z 3 = b{X,Y), is reached. This literal has 
no free variable and the membership into INi is simply checked for 6 ( 1 , 1 ) which is 
obtained from b{X, E) by substituting X and E by the values in 9 ((1.3) Table [^. 
There is no more literal to consider then a complete substitution has been found. 
The atom of the head a{X + Y) takes the values of the substitution 6. Hence, the 
forward chaining algorithm can add a( 2 ) into the propagateJN queue. 

Now, during a new instantiation attempt of the rule for the atom to propagate 
a(l), the function restarts with the last substitution of the rule 9 = {X/1,E/1} in 
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order to find a new substitution for the literal I3 = b{X,Y). The second attempt 
for an instantiation begins with atom to propagate a(l) for the second time. Since 
b(X,V) has no free variable, there can be no other substitution than the current 
one ((2.1) Table [^. The process then backtracks to the literal I 2 = a(^) which 
has no other substitution in INi (a(2) has been inferred after a(l) and is not into 
the current set INi) ((2.2) Table [^. Since literal C — a(X) can only take the 
value a(l), it also fails ((2.3) TableQ. Since the last literal has failed, the literal 
h = a(^) is now marked instead of a{X), and is instantiated with the atom to 
propagate a(l). Hence, value 1 is substituted to the variable F in 0 ((2.4) Table[^. 
Literal li = a{X) is unmarked and can only take the values of the atoms of INq, 
thus no substitution is possible. Hence the algorithm fails on the first literal ((2.5) 
Table [^. Since there is no more literal to be marked, the rule instantiation ends by 
a failure for the atom to propagate a(l). The sets becomes as follow: 

/iVo = {6(l,l),6(l,2),a(l)} 
propagate-IN = {a(2)} 

/7Vi = {6(l,l),6(l,2),a(l),a(2)} 


a{X + Y) c 

- a{X), a(Y), b{X,Y) 

*** third call to instantiateRule *** 


(1.1) 

a(2) - 

(Zi marked) 

(1.2) 

a(2) a(l) - 


(1.3) 

a(2) a(l) NO 


(1.4) 

a(2) a(2) - 


(1.5) 

a(2) a(2) NO 


(1.6) 

a(2) NO 


(1.7) 

NO 

=> failure 

(1.8) 

a(2) - 

{I 2 marked) 

(1.9) 

a(l) a(2) - 


(1.10) 

a(l) a(2) 6(1,2) 

*** fourth call to instantiateRule *** 

=> complete instantiation 

(2.1) 

a(l) a(2) NO 


(2.2) 

NO a(2) 


(2.3) 

NO 

=> failnre 


Table 2. Decomposition of the instantiations of the rule tq for the atom to propagate 
a(2) (Example [T^ continued) 


The next atom a(2) is extracted from the queue to propagate. The third attempt 
for an instantiation of tq begins with atom to propagate a(2) for the first time. 
Table shows the different steps of the instantiation. The literals a{X) and a{Y) 
are again to be marked. The rule instantiation is restarted with the literal h = a{X) 
which is the marked literal. The variable X is substituted by the value 2 since the 
only allowed value is that of the atom to propagate o(2) ((1.1) Table[^. The current 
literal is now I 2 = a{Y) where Y is substituted by the value 1 since a(l) is into INi 
((1.2) Table[^. The literal b{X,Y) has no free variable and since the atom 6(2,1) 
which respects the substitution 9 = {X/2,F/1} is not in INi, the literal b{X,Y) 
has no possible substitution ((1.3) Table[^. Then a new instantiation for I 2 = a{Y) 
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is sought: its next possible value is 2 (since a(2) is in INi) ((1.4) Table [^. Again, 
since the atom b{2,2) which respects the substitution 9 = {X/2,y/2} is not in 
INi, the literal b{X,Y) has no possible substitution ((1.5) Table [^. The process 
backtracks to the literal I 2 = a{Y) which has no possible value ((1.6) Table [^. 
Hence, the process backtracks to the literal h = a(X) which has no possible value 
since the only possible value was that of the atom to propagate a(2) ((1.7) Table[^. 
Since the first literal has failed, the process restarts by marking the second literal 
a(Y) (and unmarking the first a{X)). The marked literal I 2 = a{Y) is processed 
first, it substitutes Y by the value 2 of the atom to propagate a{2) ((1.8) Table [^. 
The unmarked literal li = a{X) may only take its values into INq. The variable 
X is then substituted by the value 1 ((1.9) Table [^. The literal I 3 = b{X,Y) has 
no free variable and since b{l, 2 ) which respects the substitution 0 = {X/ 1 ,Y/ 2 } 
is in INi a complete substitution is found ((1.10) Table [^. The atom a{X + Y) 
of the head takes then the value of the substitution 6 . Hence, the forward chaining 
algorithm can add a(3) into INi and into propagateXN. 

Then, during a new instantiation attempt of the rule rg, the atom to propagate 
is still a(2). The process restarts from the last substitution 9 = {A/l,y/2} and 
search for a new substitution for the literal = b{X, Y). A fourth attempt for an 
instantiation begins with atom to propagate a(2) for the second time. Since b{X, Y) 
has no free variable, there can be no other substitution than the current one ((2.1) 
Table [^. The process then backtracks to the literal h = a{X) that has no other 
substitution since the only possible values are those from INq (then neither the 
atom to propagate a(2) nor a(3) appeared after o(2) are possible) ((2.2) Table [^. 
The literal I 2 = a{Y) also fails since the marked literal only accepts the value of the 
atom to propagate a(2) ((2.3) Table[^. Since there is no more literal to be marked, 
the instantiation of the failing rule ends for this atom to propagate. The process 
continues with the atom a(3) which also leads to a failure. 


3.5 ASPeRiX language 


The core language of ASPeRiX is that of normal logic programs (Gelfond and Lifs- 


chitz 1988) with function symbols and true (or strong) negation without inconsistent 


answer set. ASPeRiX also provides dedicated treatment of lists with built-in predi¬ 


cates, as in DLV-complex (Calimeri et al. 2008), an extension of DLV with lists and 


sets. On the other side, ASPeRiX does not provide aggregate atoms and optimiza¬ 


tion statements (Buccafurri et al. 2000) which are accepted by the main current 
systems. 

One of the important issues in ASP is the treatment of function symbols. Un¬ 
interpreted function symbols are important because they enable representation of 
recursive structures such as lists and trees. But reasoning becomes undecidable if 
no restriction is enforced. A lot of work has been made for identifying program 
classes for which reasoning is decidable (Alviano et al. 20TI] |Alviano et al. 2010 


ICalimeri et al. 2011[ |Lierler and Lifschitz 2009 Baselice and Bonatti 2010{ Greco] 


et al. 2013). 


The inherent difficulty with functions in general (and arithmetic in particular) 
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in the framework of ASP is that it makes the Herbrand universe infinite in whole 


generality. ASP grounders Lparse (Syrjanen 19981 and versions up to 3.0 of Gringo 
(Gebser et al. 2007) accept programs respecting some syntactic domain restrictions 
and are able to deal with some restricted versions of functions. 


DLV grounder (Faber et al. 2012) and Gringo (since version 3.0) (Gebser et al. 


2011 ) only require programs to be safe and can deal with all programs having a 


finite instantiation. DLV guarantees finite instantiation for finitely ground programs 
but membership in this class is not decidable. It integrates a Finite Checker module 
which can check if a program belongs to a sub-class of finitely ground programs 
(argument-restricted programs). For programs that are not member of this sub¬ 
class, answer sets can be computed without preliminary check but ending is not 
guaranteed. 

ASPeRiX can deal with these programs and with some other programs whose in¬ 
stantiation is infinite but whose answer sets are finite. For example, the program 
from Section is not finitely ground: intelligent instantiation of the program must 
be finite to be finitely ground. The key points of intelligent instantiation are that 
rules are instantiated with atoms appearing in head of rules of the program, and 
simplifications are performed relatively to facts and rule heads of preceding compo¬ 
nents of the dependency graph. In example choice between a and b makes both 
possible for the grounder, and constraint has no effect on intelligent instantiation of 
the program. Thus, the grounding of rules from will be the same with or with¬ 
out the constraint (^ a.): infinite in both cases. ASPeRiX halts on and is thus 
able to halt on non finitely ground programs but it is not able to verify in advance 
if answer sets are finite or not, and thus if computation will end or not. Never¬ 
theless, ending can be guaranteed by means of command-line options specifying 
the maximum allowed nesting level for functional terms and the biggest admissible 
integer (DLV grounder provides similar possibilities). These restrictions ensure that 
our computations always converge to an answer set if it exists. Formalizing the class 
of programs for which ASPeRiX halts will be the subject of a forthcoming work. 


4 Experimental results 

Following Algorithm of Section |3.2[ the solver called ASPeRiX has been im¬ 
plemented in C-|--|- and is available at http://www.info.univ-cingers.fr/pub/ 
Claire/asperix. 


There are two other ASP systems, GASP (Dal Palu et al. 2009) and OMiGA (Dao- 


Tran et al. 2012), that realize the grounding of the program during the search of 


an answer set. 

GASP is an implementation in Prolog and Constraint Logic Programming over 


finite domains of the notion of computation (see Section 3.1). The main ideas are 


the same as those of ASPeRiX. Notable differences are the following. Well founded 
consequences of the program are computed first. Then propagation is close to ours. 
GASP does not deal with must-be-true atoms but two special cases of propagation, 
not treated by ASPeRiX, are implemented: (a) if the head of a rule is known to 
be in OUT set and the body of the rule is satisfied except for one positive literal. 
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then this literal must be false (added to OUT) and (b) if, for some undefined atom 
a, there is no applicable rule whose head is a, then a can be added to OUT. For 
each rule, instantiation and propagation are realized by building and solving a CSP 
that determines atoms derivable from the rule. Representation of interpretations 
uses Finite Domain Sets, such a data structure is efficient to represent compactly 
intervals but it need to code tuples (instances of predicates) by integers (very big 
integers if domain is large and arity of predicate too). This representation impose 
the set of ground terms of the program to be finite and thus function symbols are 
excluded. On the other hand, GASP supports some cardinality constraints. To our 
knowledge, GASP remained at the prototype stage and is no longer developed. 


DMiGA is implemented in Java. Functional symbols (of non-zero arity) are not 
supported. Principles of propagation and choice are the same as those of ASPeRiX 
but implementation uses Rete algorithm for improving the speed of propagation. 
First order rules are represented by a Rete network. Each node represents a literal 
(or a set of literals) from the body of a rule or the atom of the head of a rule. It 
stores all instances of the node that are true w.r.t. current partial interpretation. 
Thereby all partial instantiations of rules are stored in the network. This lead to an 
efficient propagation regarding computation time, but memory space is sacrified. 
Dependency graph and solved predicates seems to be treated in a similar manner to 
that of ASPeRiX. Current version (Weinzierl 2013) uses must-be-true propagation 
and tries to introduce methods for conflict-driven learning of non-ground rules: 
when a constraint is violated, a new constraint is built by unfolding of rules whose 
firing contributes to the conflict. This learned constraint is then transformed into 
special rules so as to be used for propagation. 


In the following we give some results of evaluation of ASPeRiX 0.2.5 highlighting 
its adequacy to some particular problems. It is compared with Clingo (composed 


by Gringo 3.0.5 and Clasp 1.3.10) (Gebser et al. 2011 

Gebser et al. 2012) 

DLV Dec 16 2012 ( 

Leone et al. 2006), GASP (June 2009) ( 

Dal Palu et al. 2009 


and OMiGA Dec 3 2012 ( |Dao-Tran et al. 2012 ). Version without learning is used 
for OMiGA because learning lowers its performances. All the systems have been run 
on an Intel Core i7-3520M PC with 4 cores at 2.90GHz and about 4GB RAM, 
running Linux Ubuntu 12.04 64 bits. For each instance of a problem, the memory 
usage is limited to 3.000MB and computation time to 600 seconds. RunLiml.7 is 
used for these limitations tasks. Tables of results use OoM (resp. OoT) to indicate 
Out of Memory (resp. Out of Time). Results for GASP are only given for the first 
two examples, because it does not accept other tested programs. 


Schur problem The Schur number problem is to partition N numbers into M 
sets such that all of the sets satisfy: if x and y are assigned to the same set, then 
X + y is not in the set. The following program (Dal Pain et al. 2009) is for M = 3 
sets and N = 4 numbers. 
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number(l)., number{2)., number(3)., nuraber{4)., 
parti!)., part{2)., part{3)., 

inpart{X, 1) not inpart{X, 2), not inpart{X,3), number{X)., 

_ inpart{X, 2) not inpart{X,l), not inpart{X,3), number{X)., 

^ inpart{X, 3) not inpart{X,l), not inpart{X,2), number{X)., 

number{X), number(Y), part{P), 
inpart(X, P), inpart{Y, P), inpart{Z, P ), 

T = Y+ 1, X <T, Z = X+ Y. 

The results are shown in Table for M = 3. AS reports the number of answer 
sets which are all computed. For all N > 14, Schur-A^ has no answer set. 


The program is a typical “guess and check” program. The seach space is expressed 
by the three rules with inpart as head predicate, and constraint eliminates “bad 
choices”. The grounding of the program is rather small but the search space is large. 
The problem is very easy for Clingo and DLV but very hard for ASPeRiX and GASP. 
Systems using grounding on the fly have to repeat instantiation of the same rules 
in each branch of the search tree. Moreover, constraints are not efficiently man¬ 
aged by systems like ASPeRiX: it does not use constraints for propagation but only 
checks if a constraint is violated. Compared to ASPeRiX, OMiGA performs well for 
computation time, certainly because Rete network improve speed of instantiation 
(partial instantiations are stored in the network) and the network remains rela¬ 
tively small in such an example. This example illustrates a large class of programs 
that ASPeRiX mismanage: programs with many choices and little propagation by 
forward chaining. 


Conversely, the following examples illustrate problems for which grounding on 
the fly is well adapted. 


Birds problem Problem birds is a stratified program encoding a taxonomy about 
flying and non flying birds, b stands for bird, f for flying, nf for nonflying, p for 
penguin, sp for superpenguin, and o for ostrich. 

( sp{X)., b{X)^^p{X)., b{X) o{X)., 'I 

Purds = i f{x) ^ b{X), not p{X), not o{X)., f{X) ^ sp{X)., > 

I nf{X) ^ p{X), not sp{X)., nf{X)^o{X). ) 

We add to this program the atoms encoding N birds with 10% of ostriches, 20% 
of penguins whose half of them are super penguins. 

The unique answer set of such a program can be computed polynomially. ASPeRiX 
uses only propagation step, without choice point, and grounders completely evaluate 
the program so that the solver has nothing to do. Experimental results for birds 
are comparable for ASPeRiX, Clingo, GASP and DLV. ASPeRiX has the best results 
for CPU time, and DLV for memory usage (Figure and [^. For such a problem, 
the number of instantiated rules must be nearly the same for all systems. On the 
other side, OMiGA system uses a very large amount of memory space, certainly due 
to the Rete network which is designed to sacrifice memory for increased speed. 
Unfortunately, memory gains expected by the first order approach are lost. 
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ASPeRiX 

Clingo 

DLV 

OMiGA 

GASP 

N = 1 

AS = 3 

time in sec 

<0.1 

<0.1 

<0.1 

<0.1 

<0.1 




memory in MB 

<2.0 

<2.0 

<2.0 

20.0 

4.4 


N = 2 

AS = & 

time in sec 

<0.1 

<0.1 

<0.1 

<0.1 

<0.1 




memory in MB 

<2.0 

<2.0 

<2.0 

20.0 

5.2 


II 

CO 

AS = 18 

time in sec 

<0.1 

<0.1 

<0.1 

0.1 

0.3 




memory in MB 

<2.0 

<2.0 

<2.0 

20.0 

6.3 


N = i 

AS = 30 

time in sec 

<0.1 

<0.1 

<0.1 

0.2 

1.0 




memory in MB 

<2.0 

<2.0 

<2.0 

24.0 

8.3 


N = 5 

AS = 66 

time in sec 

<0.1 

<0.1 

<0.1 

0.3 

3.6 




memory in MB 

<2.0 

<2.0 

<2.0 

50.0 

8.3 


N = 6 

AS = 120 

time in sec 

<0.1 

<0.1 

<0.1 

0.4 

11.0 




memory in MB 

<2.0 

<2.0 

<2.0 

63.0 

8.3 


II 

AS = 258 

time in sec 

0.3 

<0.1 

<0.1 

0.6 

41.0 




memory in MB 

2.0 

<2.0 

<2.0 

99.0 

7.5 


00 

II 

AS = 288 

time in sec 

1.0 

<0.1 

<0.1 

0.8 

113.0 




memory in MB 

2.0 

<2.0 

<2.0 

100.0 

5.7 


N = 9 

AS = 546 

time in sec 

3.6 

<0.1 

<0.1 

1.3 

370.0 




memory in MB 

2.0 

<2.0 

<2.0 

165.0 

7.5 


o 

II 

AS = 300 

time in sec 

11.1 

<0.1 

<0.1 

1.9 

OoT 




memory in MB 

2.0 

<2.0 

<2.0 

220.0 

- 


AT = 11 

AS = 186 

time in sec 

39.7 

<0.1 

<0.1 

2.8 

OoT 




memory in MB 

2.2 

<2.0 

<2.0 

290.0 

- 


Af = 12 

AS = 114 

time in sec 

131.0 

<0.1 

<0.1 

4.1 

OoT 




memory in MB 

2.2 

<2.0 

<2.0 

290.0 

- 


00 

II 

AS = 18 

time in sec 

448.0 

<0.1 

<0.1 

6.5 

OoT 




memory in MB 

2.2 

<2.0 

<2.0 

285.0 

- 


AT = 14 

AS = 0 

time in sec 

OoT 

<0.1 

<0.1 

11.0 

OoT 




memory in MB 

- 

<2.0 

<2.0 

287.0 

- 



Table 3. Experimental results for Schur 


Cutedge problem cutedge program is proposed in (Dao-Tran et al. 2012): given 
a random graph with 100 vertices and N edges, each answer set is obtained by 
deleting an edge and compute some transitive closure on the remaining edges. 
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Fig. 4. Time for birds 


P, 


cutedge 


delete{X,Y) ^ edge{X,Y), not keep(X,Y)., 
keep{X,Y) ^ edge{X,Y), delete{Xl,Yl), Xll = X., 

= < keep{X,Y) edge{X,Y), delete{Xl,Yl), yi! = Y., 
reachable{X,Y) ^ keep{X,Y)., 

reachable{X, 98) ^ reachable{X, Z), reachable{Z, 98). 

Computing each answer set is only based on propagation, and the number of 
answer sets equals the number of edges. The number of rules needed to compute 
all answer sets is proportional to while the rule number needed to compute 
one is proportional to N. But systems with pregrounding phase must generate all 
ground instances of rules even if only one answer set is required. The results are 
shown in Table ASPeRiX has the best results for this program both for CPU 
time and memory usage. OMiGA and Clingo use much more memory and are much 
slower than ASPeRiX. As expected, memory usage of Clingo is independent of the 
number of answer sets required and is close to the square of that used by ASPeRiX. 
For its part, DLV quickly exceeds the time limit imposed. 


Hamiltonian cycle problem The program (see Example [^, Hamiltonian 
cycle in a complete graph, is another easy problem with a lot of answer sets. Each 
answer set is easy to compute but the whole instantiation is huge. Experiments 
for the computation of one answer set in a graph with N vertices are represented 
in Eigures and ASPeRiX performs well on this example whereas OMiGA has 
time and memory problems similar to that of Clingo and DLV. One more time, a 
simple problem becomes intractable by systems with pregrounding phase because 
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1 

ASPeRiX 

Clingo 

DLV| 

OMiGA 

N 

= 2.SK 

AS = 1 

time in sec 

<0.1 

21 

115 

0.6 




memory in MB 

14.8 

345 

103 

85 



AS = 10 

time in sec I 

0.2 

21 

226 

1.8 




memory in MB | 

14.8 

345 

103 

200 



AS = 100 

time in sec 

2.7 

32 

OoT 

11.4 




memory in MB 

15.1 

345 


1042 



AS = 500 

time in sec 

16 

78 

OoT 

48 




memory in MB 

16.4 

345 


1050 



AS = 1000 

time in sec 

36 

123 

OoT 

84 




memory in MB 

18.0 

345 


1050 



AS = all 

time in sec 

167 

189 

OoT 

165 




memory in MB 

24.1 

345 


1050 

N 

= 4.9K 

AS = 1 

time in sec 

0.1 

60 

325 

1 




memory in MB 

23.7 

881 

144 

177 



AS = 10 

time in sec I 

0.5 

63 

OoT 

3.7 




memory in MB | 

23.8 

881 

■ 1 

425 



AS = 100 

time in sec 

5.8 

94 

OoT 

29 




memory in MB 

24.0 

881 


1100 



AS = 500 

time in sec I 

30.7 

228 

OoT 

125 




memory in MB | 

25.3 

881 

■ 1 

1150 



AS = 1000 

time in sec 

67 

373 

OoT 

245 




memory in MB 

27 

881 


1190 

N 

= 5.9K 

AS = 1 

time in sec 

0.1 

94 

465 

1 




memory in MB 

28.5 

1167 

202 

132 



AS = 10 

time in sec 

0.8 

94 

OoT 

5 




memory in MB 

28.6 

1168 


680 



AS = 100 

time in sec 

7.6 

114 

OoT 

41 




memory in MB 

29.1 

1168 


1135 



AS = 500 

time in sec 

42 

210 

OoT 

192 




memory in MB 

31.3 

1168 


1125 



AS = 1000 

time in sec I 

92 

316 

OoT 

352 




memory in MB | 

34.1 

1168 

■ 1 

1132 


Table 4. Experimental results for cutedge 
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Fig. 5. Space for birds 


they drown it in a lot of useless information so that memory used quickly becomes 
prohibitive. 

Hanoi problem Hanoi example illustrates a planning problem where the max¬ 
imum number of allowed steps is given as input. NbD is the number of disks in 
the problem and NbM is the maximum number of moves that are allowed to move 
all disks from the first rod to the third. The least value of NbM is the minimum 
required to achieve the goal, then its value is gradually increased to evaluate its 
impact. The complete program is given in [Appendix A| and experimental results 
are shown in Table ASPeRiX performances are (almost) independent of the given 
number of moves: search, and therefore grounding, are stopped when a solution is 
found. Conversely, grounders are quickly overwhelmed as they are obliged to fully 
instantiate the program with all hypothetical (and unnecessary in this case) calcu¬ 
lation steps[^ This example cannot be computed by DMiGA due to restrictions on 
the input language it accepts (function symbols are not supported). 

Three coloring problem The program (see Example , 3-coloring problem 
on a graph organized as a bicycle wheel, poses no problem for Clingo and DLV 
(cf. Table . But ASPeRiX and DMiGA have bad results on this example because 
they are mismanaging constraints. Once a vertex is colored, say red, constraint 

^ iClingo iGebser et al. 2008)1 was created to address this specific problem. Some directives are 
added to the program in order to incrementally instantiate some predicates of the program. But 
it does not escape the grounding/solving separation, it only introduces some tools to control 
the process. 
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I ASPeRiX I Cllngo | DLV | 


NbD = 4 


NbD = 5 


NbM = 15 

time in sec 

<0.1 

<0.1 

< 0.1 


memory in MB 

- 

- 

- 

NbM = 60 

time in sec 

<0.1 

0.7 

0.8 


memory in MB 

- 

27 

22 

NbM = 100 

time in sec 

<0.1 

1.5 

3.6 


memory in MB 

- 

54 

51 

NbM = 500 

time in sec 

<0.1 

11.6 

- 


memory in MB 

- 

327 

OoM 

NbM = 1000 

time in sec 

<0.1 

27 

- 


memory in MB 

- 

693 

OoM 

NbM = 2000 

time in sec 

<0.1 

66 

- 


memory in MB 

- 

1523 

OoM 

NbM = 5000 

time in sec 

<0.1 

- 

- 


memory in MB 

- 

OoM 

OoM 

NbM = 10000 

time in sec 

<0.1 

- 

- 


memory in MB 

- 

OoM 

OoM 

NbM = 50000 

time in sec 

0.1 

- 

- 


memory in MB 

12.9 

OoM 

OoM 

NbM = 100000 

time in sec 

0.3 

- 

- 


memory in MB 

23.9 

OoM 

OoM 

NbM = 31 

time in sec 

0.2 

0.1 

0.1 


memory in MB 

3.7 

6.4 

4 

NbM = 50 

time in sec 

0.2 

0.7 

0.9 


memory in MB 

3.7 

28 

31 

NbM = 100 

time in sec 

0.2 

8.2 

9.4 


memory in MB 

3.8 

245 

270 

NbM = 500 

time in sec 

0.2 

91 

- 


memory in MB 

3.8 

2055 

OoM 

NbM = 1000 

time in sec 

0.2 

- 

- 


memory in MB 

3.8 

OoM 

OoM 

NbM = 5000 

time in sec 

0.2 

- 

- 


memory in MB 

4.8 

OoM 

OoM 

NbM = 10000 

time in sec 

0.2 

- 

- 


memory in MB 

5.8 

OoM 

OoM 

NbM = 50000 

time in sec 

0.4 

- 

- 


memory in MB 

14.4 

OoM 

OoM 

NbM = 100000 

time in sec 

0.7 

- 

- 


memory in MB 

25 

OoM 

OoM 


NbD = 6 


NbM 

= 63 

time in sec 

4.7 

0.6 

1.1 



memory in MB 

10.3 

24 

29 

NbM 

= 100 

time in sec 

4.7 

9 

- 



memory in MB 

10.3 

272 

OoM 

NbM 

= 150 

time in sec 

4.7 

83 

- 



memory in MB 

10.3 

1863 

OoM 

NbM 

= 200 

time in sec 

4.7 

- 

- 



memory in MB 

10.3 

OoM 

OoM 

NbM 

= 500 

time in sec 

4.7 

- 

- 



memory in MB 

10.4 

OoM 

OoM 

NbM 

= 1000 

time in sec 

4.7 

- 

- 



memory in MB 

10.5 

OoM 

OoM 

NbM 

= 5000 

time in sec 

4.7 

- 

- 



memory in MB 

11.3 

OoM 

OoM 

NbM 

= 10000 

time in sec 

4.8 

- 

- 



memory in MB 

12.4 

OoM 

OoM 

NbM 

= 50000 

time in sec 

5 

- 

- 



memory in MB 

21 

OoM 

OoM 

NbM 

= 100000 

time in sec 

5.6 

- 

- 



memory in MB 

31.7 

OoM 

OoM 


Table 5. Experimental results for Hanoi tower problem 
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Fig. 6. Time for Hamiltonian cycle 

e{V,U),col{V,C),col{U,C).) prohibits coloring adjacent vertices of the same 
color. In propositional systems, unit propagation (or equivalent) works well and 
allows to infer that adjacent vertices are not colored red. But first-order approach 
does not allow, in general case, to use unit propagation and thus, constraints are 
mainly used for verification and not for propagation. A lot of work remains on 
these points. First-order constraints could instead allow more powerful propagation. 
Suppose for example a constraint p{X, Y),p{Y, Z).) and p(l, 2) is added in IN 
set then, for all Z, p(2, Z) can be excluded at once from current solution, even if Z 
values are potentially inhnite. But these opportunities are not exploited yet. 

To sum up, ASPeRiX is efficient to deal with stratified programs or simple prob¬ 
lems whose instantiation is infinite or huge but much of which is useless to compute 
one specific answer set. On the other hand, the system is not competitive for more 
combinatorial problems, with a large search space and few solutions, because propo¬ 
sitional methods for propagation, heuristics, learning lemmas did not apply to the 
first order case. 


5 Conclusion 

In this paper, we have presented the ASPeRiX approach to answer set computation. 
Our methodology deals with first order rules following a forward chaining with 
grounding process realized on the fly and has been implemented in the ASP solver 
ASPeRiX. This paper is the first comprehensive document in which a survey of the 
important techniques relevant to our approach is presented. 

Starting from a short description of state-of-the-art ASP working principle, we 
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Fig. 7. Space for Hamiltonian cycle 


have presented by many examples the main motivation of our approach: escaping 
the bottleneck of the preliminary phase of grounding in which many state-of-the-art 
systems fall. After a presentation of the theoretical foundations of ASP, we have 
described by an ASPeRiX computation our first order forward chaining approach for 
answer set computing and have established the soundness and completeness of this 
calculus w.r.t. the semantics of ASP (Proofs are reported in Appendix B). We have 
then described in details the main algorithms of ASPeRiX and particularly those 
which realize the selection of the first order rules to be instantiated and applied 
according to the current answer set in construction. 

Our methodology allows very good performances for definite and stratified pro¬ 
grams. It outperforms systems with a pregrounding phase for programs with large 
grounding but much of it is unnecessary to solve the problem. On the other side, 
performances quickly degrade for combinatorial problems with large search spaces, 
especially if forward chaining propagation can not be exploited. 

We have shown that our approach escapes the bottleneck of the preliminary 
phase of grounding that is the only difhculty for some classes of programs. A direct 
consequence of our new approach is that the use of symbolic functions in general 
and arithmetic calculus in particular inside ASP is greatly facilitated. 

The forward chaining with the grounding process realized on the fly as an oper¬ 
ational semantics emphasizes the programming aspect of ASP in which the answer 
set is not only the result of a black box but the result of a process that may be 
followed. This is interesting when dealing with knowledge coming from the web 
and expressed in description logic since the structure of information uses rules that 
are chained ones with the others (whereas this is not always the case for a pro- 
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0.3 
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<2 

1 

<1 

45 


N = 

11 

AS = all{6) 
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3.4 

<0.1 

<0.1 
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memory in MB 

1.8 

1 

<1 

132 


N = 
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AS = 1 
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<0.1 

<0.1 

<0.1 

OoT 





memory in MB 

3 

1.5 

1.1 

- 


N = 
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time in sec 

OoT 

<0.1 

<0.1 

OoT 





memory in MB 

- 

1.8 

1.4 

- 


N = 

501 

AS = 1 

time in sec 

1.6 

<0.1 

<0.1 

OoT 





memory in MB 

8.8 

3.3 

3.3 

- 


N = 

501 

AS = all{6) 

time in sec 

OoT 

<0.1 

0.3 

OoT 





memory in MB 

- 

3.3 

3.3 

- 


N = 

1001 

AS = 1 

time in sec 

13.3 

<0.1 

0.1 

OoT 





memory in MB 

15.9 

5.5 

5.5 

- 


N = 

1001 

AS = all{6) 

time in sec 

OoT 

<0.1 

1.4 

OoT 





memory in MB 

- 

5.5 

5.5 

- 



Table 6. Experimental time results for 3col 


gram encoding a combinatorial problem). Moreover, when dealing with knowledge 
expressed in description logic, one important issue is the ability to query the knowl¬ 
edge base. The grounding process realized on the fly will then allow to focus only 
on the rules useful to find an answer to the query. For this category of programs, 
we think that our approach may be of great interest. 

Furthermore, computing the answer sets of a program is a fundamental goal but 
not an exclusive one. Debugging a program, controlling its behavior, introducing in 
it some features coming from other programming languages may be of great interest 
for ASP. We think that our methodology of answer set computing, guided by the 
rules of the program, is the good starting point towards these new goals. 


The ASPeRiX project is still in progress. Improvements at the algorithmic level 
are underway by the development and implementation of backjumping and clause 
learning techniques. On the other hand, we plan to fully respect the core language 
ASP (Calimeri et al. 20141 by introducing, among others, minimization / maximiza¬ 
tion and aggregates and extend it by introducing existentially quantified variables 
in multi-head rules to encode fragments of Description Logics which are logical 
formalisms for ontologies and the Semantic Web. 
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Tribute 

In memory of the late Pascal Nicolas who was at the origin of this work. He sadly 
passed away in 2010 but his enthusiasm, his passion for research and his great 
humanity are still with us. 
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Appendix A Hanoi example 

The following ASP program is the Hanoi example with 4 discs. 

"/o-Initial settings 

nmnber_of_moves(10000) . 
largest_disc(4). 

7o-Initial state 

initial_state(towers(1(4,1(3,1(2,1(1,nil)))),nil,nil)). 
y„-Goal state 

goal(towers(nil, nil, 1(4,1(3,1(2,1(1,nil)))))). 
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7o-all discs involved- 

disc(l..4). 

y„-legal stacks- 


legalStack(nil). 

legalStack(l(T,nil)) disc(T). 

legalStack(l(T,l(Tl,S))) legalStack(l(Tl,S)), disc(T), T > Tl. 

°L -possible moves- 

possible_state(0,towers(SI,S2,S3)) 

:- initial_state(towers(SI,S2,S3)), 
legalStack(Sl), legalStack(S2), legalStack(S3). 

possible_state(l,towers(SI,S2,S3)) 

possible_move(l,T,towers(Sl,S2,S3)). 

7o From stack one to stack two. 

possible_move(J,towers(l(X,Sl),S2,S3),towers(Sl,l(X,S2),S3)) 
possible_state(1,towers(1(X,SI),S2,S3)), 
number_ofjnoves(N), 1<=N, legalStack(l(X,S2)), J=l+1, not ok(l) 

7o From stack one to stack three. 

possible_move(J,towers(1(X,SI),S2,S3),towers(S1,S2,1(X,S3))) 
possible_state(1,towers(1(X,SI),S2,S3)), 
number_ofjnoves(N), 1<=N, legalStack(l(X,S3)), J=l+1, not ok(l) 

7o From stack two to stack one. 

possible_move(J,towers(Sl,l(X,S2),S3),towers(l(X,Sl),S2,S3)) 
possible_state(l,towers(Sl,l(X,S2),S3)), 
number_of_moves(N), 1<=N, legalStack(l(X,Sl)), J=l+1, not ok(l) 

7o From stack two to stack three. 

possible_move(J,towers(Sl,l(X,S2),S3),towers(Sl,S2,l(X,S3))) 
possible_state(l,towers(Sl,l(X,S2),S3)), 
number_ofjnoves(N), 1<=N, legalStack(l(X,S3)), J=l+1, not ok(l) 

7o From stack three to stack one. 

possible_move(J,towers(SI,S2,1(X,S3)),towers(l(X,Sl),S2,S3)) 
possible_state(l,towers(Sl,S2,l(X,S3))), 
number_of.moves(N), 1<=N, legalStack(l(X,Sl)), J=l+1, not ok(l) 

7o From stack three to stack two. 

possiblej]iove(J,towers(Sl,S2,l(X,S3)) ,towers(S1,1(X,S2) ,S3)) 
possible_state(l,towers(Sl,S2,l(X,S3))), 
number.ofjnoves(N), 1<=N, legalStack(l(X,S2)), J=l+1, not ok(l) 
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7„-actual moves- 

°L a solution exists if and only if there is a "possiblejnove" 
y„ leading to the goal. 

7„ in this case, starting from the goal, we proceed backward 
7o to the initial state to single out the full set of moves. 

7o Choose from the possible moves, 
move(1,towers(SI,S2,S3)) 

goal(towers(SI,S2,S3)), possible_state(l,towers(Sl,S2,S3)). 
ok(l) moved,towers(Sl,S2,S3)) , goal (towers (SI, S2, S3) ) . 

win :- ok(l). 
not win. 

moved, towers (SI, S2 ,S3) ) 

moved,towers(Al,A2,A3)) , 

possible_move(l,towers(SI,S2,S3),towers(Al,A2,A3)), J=l-1, 
not nomove(J,towers(SI,S2,S3)). 

nomove(J,towers(SI,S2,S3)) 

move(1,towers(A1,A2,A3)), 

possible_move(l,towers(SI,S2,S3),towers(Al,A2,A3)), J=l-1, 
not moved,towers(Sl,S2,S3)) . 

7o-precisely one move at each step- 

moveStepld) moved,T). 

legalMoveNumber(1), ok(J), 1<J, not moveStepl(l). 

legalMoveNumber(1) , moved,Tl), moved,T2), T1!=T2. 

legalMoveNumber(0). 

legalMoveNumber(K) 

legalMoveNumber(1) , number_of.moves(J) , 1 < J, K=l+1. 


#hide. 

#show move/2. 


Appendix B Proofs 
B.l Proof of Theorem^ 

Proof, (of Theorem Let P be a normal logic program and X C A. Let us note 
first that if GRp{X) is grounded then Cn{GRp{X)^) = head{GRp{X)). 

If X is an answer set of P then, by a theorem from (Konczak et al. 2006), GRp{X) 
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is grounded and, also according to (Konczak et al. 2006), X = Cn{GRp{X)^). Since 
Cn{GRp{X)^) = head{GRp{X)), we can conclude X = head{GRp{X)). 

Let us now suppose that X = head{GRp{X)) and GRp{X) is grounded. We 
have Gn{GRp{X)^) = head{GRp{X)), then X = Gn{GRp{X)'^) and, according 
to (Konczak et al. 2006), X is an answer set of P. 

□ 


B.2 Proof of Theorem^ 

We first give some material needed in the proof. Auxiliary Lemma [l] is used in the 
proof of Lemma Lemmas and establish completeness and correctness. 

Lemma shows that the generating rules of a program can be ordered so as 
to correspond to the order of application of rules in an ASPeRiX computation. 
Condition (1) says that a rule used at step i is supported at this step. Condition 
(2) says that if a rule is a member of Apro at step i but is used at a later stage j, 
then all rules used at steps between i and j are members of Apro at step i. In other 
words, condition (2) says that propagation is entirely completed before making a 
choice. 

Lemma 1. Let P be a normal logic program and X be an answer set of P. Then, 
there exists an enumeration of GRp{X), the set of generating rules of 

X, such that for all i G [l..n] the following two conditions are satisfied: 

(1) body^{ri) C head{{rk \ k < i}) 

(2) for all j > i, if body^(rj) C head{{rk | fc < f}) and body~(rj) C body~{{rk \ 
k < *}) then body~{ri) C body~{{rk | A: < *}). 

Proof, (of Lemma ^ Let P be a normal logic program and X be an answer set 
of P. By Theorem ij there exists an enumeration of GRp{X) such that 

Vi S [l..u], body~^{^ C head{{rk \ k < i}), i.e. such that condition (1) is satisfied. 
This enumeration can be recursively modified in the following way in order to verify 
condition (2). For each i G [l..n], if satishes (2) then remains at rank i, else 
there exists rj with j > i that falsifies condition (2). In this last case, it suffices to 
swap the two rules in the enumeration to satisfy condition (2) at rank i. □ 

Notation. If P is a normal logic program and {Ri, {INi,OUTi))^Q is a sequence 
of ground rule sets Ri and partial interpretations {INi,OUTi), then denotes 
Apro{P, {INi,OUTi),Ri) and denotes Acho{P, {INi,OUTi), Ri). 

Lemma 2. Let P be a normal logic program and X be an answer set of P. Then 
there exists an ASPeRiX computation that converges to X. 

Proof, (of Lemma[^ Let P be a normal logic program and X be an answer set of P. 
Then, there exists an enumeration (ri)^^^.^ of GRp{X) that satisfies conditions 
(1) and (2) from Lemma 

Let {Ri, {INi, OUTi))°^Q be the sequence defined as follows. 

• Pq = 0 , INo = 0 and OUTq = {T} 
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• 'ii,l < i < n, Ri = Ri-x U {rt}, INi = U {head{ri)} and OUR = 

OUTi^i U body~{n) 

• yi > n, Ri = Ri-i, INi = INi-i and OUR = OUR-i 
For all i G [1 ■•«.], we have: 

(*1) X = head{GRp{X)) (by Theorem[^ 

(*2) IN, = U;=i{^ead(r,)} and IN^ = U~o ^ (by (*1)) 

(*3) OUR = [Sj^ibody~ {vj) and therefore OUR Cl X = 0 (by Definition of 
GRp{X)) 

(*4) Apro{P,{IN„OUR),R,) C GRpiX) 

Property (*4) can be proved as follows. By definition^ ^pro = {r G ground{P) \ 
Ri I body'^{r) C INi and body~{r) C OUR}. And by (^2) and (*3), INi C X and 
OUR n A = 0. Thus C GRpiX). 

We are now able to prove that the sequence {Ri, {INi, OUR))^q is an ASPeRiX 
computation. 

Let us first note that Vi, {INi, OUR) is a partial interpretation since INi H 
OC/Ti = 0 (by (*2) and (*3)). 

Now we prove that Revision principle holds for each i > 1. Let i such that 
1 <i <n, then is such that body^{ri) C head{{rk \ k < i}) = /W_i. Two cases 
are possible. First, if body~{ri) C body~{{rk \ k < i}) = OUR-i, then n G 
and Revision principle holds at rank i. Second, if body~{ri) % body~{{rk | fc < i}) 
then, by definition of enumeration (ri)jgjj^ ^j, there is no rule rj with j > i such 
that body~^{rj) C INi-i and body~{rj) C OUR-i. So Ap^ n GRp{X) = 0. And 
as ApQ C GRp{X) (by (*4)), Ap^ = 0. Moreover, ri is a generating rule, thus 
body~{ri)r\X = 0 and body~{ri)fMNi-i = 0 (since INi-i C X). Thereby G A*^^ 
and Revision principle holds. If i > n. Revision principle trivially holds (Stability). 

At step n, we have /A„ = \J^^i{head{rj)} = X and Rn = Uj=i{^i} = GRp{X). 
^'cho = {^ S ground{P) \ | body'^{r) C X and body~{r) D X = 0}. Thus 

^cho — Convergence principle holds and INac = INn = X. □ 

Lemma 3. Let P be a normal logic program and {Ri, {INi,OUR))'^f^ be an 
ASPeRiX computation for P. Then, IN^o is an answer set of P. 

Proof, (of Lemmaj^ Let {Ri, {INi, OUR))°^q be an ASPeRiX computation for P. 

We first prove that Vi > 0, Vj > i — 1, Ri C GRp{INj). For each rule r*, 
body^{ri) C INi_i and IN set increases monotonically, thus body~^{ri) C INj,\/j > 
i — 1. If S then body~{ri) C OUR-i and OUR-i n INi_i = 0. Since IN 

and OUT sets grow monotonically with an empty intersection, body~{ri) H INj = 
0jVj > i — 1. If Ti G ^ben body~{ri) n INi_i = 0. And, since OUR = 

OUR-i U body~{ri), we have Vj > i,body~ {rx) C OUTj, and thus, with the same 
reasonning as above {ri G Ap^), body~{ri) n INj = 0, Vj > i — I. 

Ri = Ufc=i{^fc} and, since 'dj > k — I, r^ G GRp{INj), G GRp{INi). Thus 
R, C GRp{INi). 

By Convergence principle we have 3z, A),^^ = {r G ground{P) \ Ri \ body'^{r) C 
INi and body~{r)C\INi = 0} = 0, then GRp{INi) C Ri. Since Vi, Ri C GRp{INi), 
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GRp{INi) = Ri. And INi = head{Ri) (by definition of an ASPeRiX computation), 
thus INi = head{GRp{INi)). 

Moreover, for all i > 0, body^{ri) C INt-i, thus Ri = Ul=i{’'fe} is grounded 
and, since Ri = GRp{INi), GRp{INi) is grounded. Finally, by Theorem]^ INi is 
an answer set of P. 

□ 

Proof, (of Theorem]^ Lemmas]^ andprove each one direction of the equivalence. 

□ 


B.3 Proof of Theorem^ 

Lemmas 1^ and establish completeness and correctness. 

Lemma 4. Let P be a normal logic program and X be an answer set for P. Then 
there exists a mbt ASPeRiX computation for P that converges to X. 

Proof, (of Lemma Let P be a normal logic program and X an answer set for 
P. By Theorem]^ there exists an ASPeRiX computation {Ri, {INi,OUTi))^Q with 
IN^ = X. Let G = {Ki, R„ {INi, MBTi, OUTi))"^^ with Ki = MBT, = 0, Vi > 0. 
G is clearly a mbt ASPeRiX computation for P where “Rule exclusion” is never used 
and thus “Mbt-propagation” is not used either. □ 

Lemma 5. Let P be a normal logic program and {Ki, Ri, {INi, MBTi, OUTi))'^^^ 
be a mbt ASPeRiX computation for P. Then IN^o is an answer set of P. 

Proof, (of Lemma § Let {Ki,Ri,{INi,MBTi,OUTi))^Q a mbt ASPeRiX compu¬ 
tation for P. Then G = {Ri, {INi,OUTi))^Q is an ASPeRiX computation for P: 
it satisfies Revision principles of an ASPeRiX computation and it trivially satisfies 
Convergence too. By Theorem]^ G converges to an answer set IN^q. □ 

Proof, (of Theorem]^ Lemmas]^ andprove each one direction of the equivalence. 

□ 


